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HEPITITIS C VTRPS KPTTOPTCB 



1. Field of Invention 

10 This invention relates to specific peptide viral anti- 

gens which are immunoreactive with sera from patients in- 
fected, with parenterally transmitted non-A, non-B hepatitis 
virus (PT-NANBH, now called Hepatitis C. Virus) , to poly- 
nucleotide sequences which encode the peptides, to an ex- 

15 pression system capable of producing the peptides, and to 
methods of using the peptides for detecting PT-NANBH infec- 
tion in human sera. 
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3. Background 

Viral hepatitis resulting from a virus other than 

10 hepatitis A virus (HAV) and hepatitis B virus (HBV) has 
been referred to as non-A, non-B hepatitis (NANBH) . More 
recently, it has become clear that NANBH encompasses at 
least two, and perhaps more, quite distinct viruses. One 
of these, known as enterically transmitted NANBH or ET- 

15 NANBH, is contracted predominantly in poor-sanitation areas 
where food and drinking water have been contaminated by 
fecal matter. The molecular cloning of a portion of this 
virus, referred to as the hepatitis E virus (HEV) , has 
recently been described (Reyes et al. ) . 

20 The second NANB virus type, known as parenterally 

transmitted NANBH, or PT-NANBH, is transmitted by parenter- 
al routes, typically by exposure to blood or blood prod- 
ucts. Approximately 10% of transfusions cause PT-NANBH 
infection, and about half of these go on to a chronic 
25 disease state (Dienstag) . 

Human sera documented as having produced post-trans- 
fusion NANBH in human recipients has been used successfully 
to produce PT-NANBH infection in chimpanzees (Bradley) . 
RNA isolated from infected chimpanzee sera has been used to 
30 construct cDNA libraries in an expression vector for immu- 
noscreening with chronic-state human PT-NANBH serum. This 
procedure identified a PT-NANBH specif ic cDNA clone and the 
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viral sequence was then used as a probe to identify frag- 
ments making up 7,300 contiguous basepairs of a PT-NANBH 
viral agent (EPO patent application 88310922.5, filed 
11/18/88) . The same procedure was used by the present 
5 inventors to derive two of the PT-NANBH peptide and poly- 
nucleotide sequences disclosed herein. The sequenced viral 
agent has been named HCV (HCV) (above EPO patent applica- 
tion) 

Heretofore, one immunogenic peptide encoded by the HCV 
0 viral agent has been reported (Choo, Kuo, EPO application 
; 88310922.5) . This peptide, designated C-100, has been used 
in immunoassays of PT-NANBH sera and found to react immuno- 
, specifically with up to 80% of chronic NANBH samples, and 
about 15% of acute NANBH samples (Kuo) . 
5 It is desirable to provide one or a collection of pep- 

tide antigens which are immunoreactive with a greater per- 
centage of PT-NANBH-infected blood, including both acute 
and chronic PT-NANBH infection . 

20 4. Summary of the Invention 

It is one general object of the invention to provide 
recombinant polypeptides immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) , including a 
peptide which is immunoreactive with a high percentage of 
25 sera from chronic HCV-infected . individuals, and peptides 
which are immunoreactive with sera associated with acute 
HCV infection. 

xt is another object of the invention to provide an 
HCV polynucleotide sequence encoding a sequence for recom- 
30 binant production of the peptide antigens, and a diagnostic 
method for detecting, HCV- infected human sera using the pep- 
tide antigens. 
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The invention includes, in one aspect, a peptide anti- 
gen which is immunoreactive with sera from humans infected 
with HCV. One peptide antigen in the invention includes an 
immunoreactive portion of an HCV polypeptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence . presented as SEQ 
ID N0:4. 

Other peptide antigens of the invention include an im- 
munoreactive portion of any one of. the following sequences: 
SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO:12, SEQ 
ID NO.-14, SEQ ID NO:16,. SEQ ID NO:18, SEQ ID NO:20, SEQ ID 
NO:22, SEQ ID NO:24, and SEQ ID NO:26.. 

In another aspect, the invention includes diagnostic 
kits for use in screening human blood containing antibodies 
specific against HCV infection. The kit includes at least 
one peptide antigen which is immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) : specific 
peptide antigens for use in the kit are given above. 

One preferred embodiment of the present invention is 
a diagnostic kit containing the 409-1-1 (c-a) (SEQ ID NO: 8) 
and one of the HCV-capsid derived proteins (SEQ ID N0s:i2, 
14., 16, 18, 20, 22, 24, and 26) : two particular embodi- 
ments being 409-l-i( C -a) with the C1NC450 caps id-derived 

peptide, and 409-1-1 (c-a) with the C1NC360 caps id-derived 
peptide. 

In one embodiment of the present . invention, the 
antigen is immobilized on a solid support. The binding of 
HCV-specific antibodies to the immobilized antigen is 
detected by a reporter-labeled anti-human antibody which 
acts to label the solid support with a detectable reporter. 
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The kit is used in a method for detecting HCV infec- 
tion, in an individual by: (i) reacting serum from an HC\H 
infected test individual with the above peptide antigen, 
and (ii) examining the antigen for the presence of bound 
antibody. 

The peptide antigens are produced, in accordance with 
another aspect of the invention, using an expression system 
for expressing a recombinant peptide antigen which is immu- 
noreactive with sera from, humans infected with hepatitis C 
virus. (HCV). A selected expression vector containing an 
open reading frame (ORF) of a polynucleotide which encodes 
the peptide is introduced into a suitable host, which is 
cultured under conditions which promote expression of the 
ORF in the expression vector. 

In one embodiment, the polynucleotide is inserted into 
an expression site in a lambda gtll phage vector, and the 
vector is introduced into an E. coli host. The following 
B. coli hosts have been deposited which contain vectors 
including the coding sequences of the antigens shown in 

parenthesis: ATCC No 40901 (SEQ ID NO:3), ATCC NO. 40893 . 
(SEQID NO.-l), and ATCC No. 40792 (SEQ ID NO: 7) , and ATCC 
No. 40876 (SEQ ID NO:9) . pGEX and pET are two other 
vectors which have been used to express HCV antigens, it 
will be appreciated that determination of other appropriate 
25 vector and host combinations for the expression of the 
above sequences are within the ability of one of ordinary 
skill in the art. 

Also forming part of the invention are polynucleotides 
which encode polypeptides immunoreactive with sera from 
humans infected with hepatitis C virus (HCV) . One polynu- 
cleotide of the present invention encodes a polypeptide 
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wherein the polypeptide includes an immunoreactive portion 
of a peptide sequence which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 
5 c) has the carboxy-terminal sequence presented as SEQ 

ID NO:4; and, where the carboxy-terminal amino acid se- 
quence of said peptide antigen is encoded by the polynu- 
cleotide sequence presented as SEQ ID NO: 3. 

Other polynucleotides of the invention include any one 
10 of the following sequences: SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9,. SEQ ID NO:ll, SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID 
NO: 21,* SEQ ID NO: 23, and SEQ ID NO : 25 . 

These and other objects and features of the invention 
15 will become more fully apparent when the following detailed 
description is read in conjunction with the accompanying 
. drawings . 

.5- Brief Description of the Drawings 

Figure 1 illustrates the steps in producing overlap- 
20 ping linking fragments of a nucleic acid segment, in 
accordance with the methods of the present invention; 

Figure 2 shows the positions of overlap primer regions 
and linking regions along a 7, 300 basepair portion of. the 
HCV genome. 

25 Figure 3 shows the DNA coding sequence of the clone 40 

insert. The underlined sequences correspond to an R$ primer 
region. 

Figure A shows the DNA coding sequence of a clone 36 
insert. The underlined sequences correspond, respectively, 
30 to the F 7 , F 8 , and R g primer regions. 

F Figure 5 shows the DNA and protein coding sequences 
for a 409-1-1 (abcj clone insert. The "A" region of this 
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sequence is delineated by boxes, the "B" region by a box 
and a triangle, and the »c" region by a triangle and an 
asterisk. 

Figure 6 shows the DNA and protein coding sequences 
5 for a 409-1-1 (c-a) clone insert. 

Figure 7 illustrates the groups of clones which have 
been obtained from the HCV genome in the region correspond- 
ing to the 4 09 -1-1 (abc) clone insert. 

Figure 8A shows the DNA and protein coding sequences 
10 for ,the pGEX-GGl insert. The three G's above the first 

^^^^^^^ 

were made to generate the 
clone pGEX-CapA. Figure 8B shows the DNA and protein 
sequences for the pGEX-CapA insert coding sequence. The 
primers used in polymerase chain reactions to generate 

15 carboxy and amino terminal deletions are indicated below 
the nucleotide line. The sequences of the primers are 
indicated in the sense (coding strand) . The actual 
sequence of the NC (non-coding) primers is the reverse 
complement of the indicated sequence. Coding primers are 

20 underlined ; reverse (noncoding) primers are double-under^ 
lined. Sequences shown in capital letters are exact 
matches . Sequences in lowercase letters are "mismatched" 
sequences used to introduce the terminal restriction sites 
(Ncol at the 5' ends and BamHI at the 3' ends). The three 

25 nucleotides which have been altered to remove the "slippery 
codbns" at positions 24, 27, and 30 are indicated by bold 
type with, the wild type A residues shown above the se- 
quence. 

Figure 9 shows a hydropathicity plot of the HCV-core 
30 protein encoded by pGEX-CapA. The relative location of the 
primers, used to generate carboxy and amino terminal 
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deletions, are indicated relative to the protein coding 
sequence by arrows. 

Figure 10 shows an epitope map of the HCV caps id 
protein region. 

5 ■;' ■ '. v • • • • " 

6 * Detaile d Description of the Invention 

I. Definitions ■ v-- ■ .'• ' ' "■ ' 

The terms defined below have the following 
10 ; meaning herein: ■ ' .• " 

,1. "Parenterally transmitted non-A, non-B hepatitis 
viral agent (PT-NANBH)" means a virus, virus type, or virus 
class which (i) causes parenterally transmitted infectious 
hepatitis, (ii) is transmissible in chimpanzees, (iii) is 
15 serologically distinct from hepatitis A virus (HAV) , hepa- 
titis B virus (HBV) , and hepatitis E virus (HEV) . 

2v "HCV (HCV) " means a PT-NANBH . viral agent whose 
polynucleotide sequence includes the sequence of the 7 , 3 00 
. basepair region of HCV given in the Appendix, and varia- 
20 tions of the sequence , such as degenerate codons , or 
variations which may. be present in different isolates or 
w strains of HCV. 

3 . Two -nucleic acid fragments are "homologous" if 
they are capable of hybridizing to one another under 

25 hybridization conditions described in Maniatis et a2^, op. 
Sits./ PP. 320-323, using the following wash conditions: 2 
x SCC, 0.1% SDS, room temperature twice, 30 minutes each ; 
then 2 x SCC, 0.1% SDS, 50 8 C once, 30 minutes ; then 2 x 
SCC, room temperature twice, io minutes each, homologous 

30 sequences can be identified . that contain at most about 25- 
.30% basepair mismatches. More preferably, homologous 
nucleic acid strands contain 15-25% basepair mismatches, 
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even more preferably 5-15% basepair mismatches. These 
degrees of homology can be selected by using more stringent 
wash or hybridization conditions for identification of 
clones from gene libraries (or other sources of genetic 
material), as is well known in the art. 

4. A DNA fragment is "derived from" HCV if it has 
substantially the same basepair sequence as a region of the 
HCV viral genome which was defined in (2) above. 

5. A protein is "derived from" a PT-NANBH "or HCV 
virai. agent if it is encoded by an open reading frame of a 
cDNA or RNA fragment derived from a PT-NANBH or HCV viral 
agent > respectively. 

1J ' Molecular Clone Sgl^ t ion bv TTn^inoscreenin, 

As one approach toward identifying a molecular clone 
of a PT-NANBH agent, cDNA libraries are prepared from in-' 
f ected sera in the expression vector lambda gtll. cDNA se- 
quences are then selected for expression of peptides which 
are immunoreactive with PT-NANBH- infected sera. Recombi- 
nant proteins identified by this approach provide candi- 
dates for peptides which can serve as substrates in diag- 
nostic tests. Further, the nucleic acid coding sequences 
identified by this approach serve as useful hybridization 
probes for the identification of further PT-NANBH coding 
25 sequences. 

In order to make immunoscreening a useful approach for 
identifying clones originating from PT-NANBH coding sequen- 
ces, a well-defined source of PT-NANBH virus is important. 
To generate such a source, a chimpanzee (#771; Example 1A) 
30 was infected with transmissible PT-NANBH agents using a 
Factor VIII concentrate as a source (Bradley) . The Factor 
VIII concentrate was known to contain at least two forms of 
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parenterally transmitted NANB hepatitis (PT-NANBH) . In ad- 
dition to a chloroform-sensitive agent, which has subse- 
quently been called HCV (HCV) , a chloroform-resistant form 
of PT-NANBH was also , transmitted in^ the concentrate 
5 (Bradley, 1983) : 

In the method illustrated in Example 1, infected serum 
was pelleted, without dilution, by centrifugation, and cDNA 
libraries were generated from the resulting pelleted virus 
(Example IB and 1C) . Sera from infected human sources were 
10 treated in the same fashion. cDNA libraries were generat- 
ed, e.g., by a random primer method using the RNA extracted 
from pelleted sera as starting material (Example IB and 
1C) . The resulting cDNA molecules were then cloned into a 
suitable vector, for example, lambda gtll, for expression 
15 and screening of peptide antigens, and lambda gtio, for 
hybridization screening (Example ic(iv) ) . Lambda gtll is 
a particularly useful expression vector which contains a. 
unique EcoRI insertion site 53 base pairs upstream of the 
translation termination codon of the beta-galactosidase 
20 gene. Thus, an inserted sequence is expressed as a beta- 
galactosidase fusion protein which contains the N-terminal 
portion of the beta-galactosidase gene, the heterologous 
peptide, and optionally the C-terminal region of the beta^ 
galactosidase peptide (the C-terminal portion being ex- 
25 pressed when the heterologous peptide coding sequence does 
not contain a translation termination codon) . This vector 
also produces a temperature-sensitive repressor (cI857) 
which causes viral lysogeny at permissive temperatures, 
e.g., 32°C, and leads to viral lysis at elevated tempera- 
30 tures, e.g., 42°C. Advantages of this vector include: (1) 
highly efficient recombinant clone generation, (2) ability 
to select lysogenized host cells on the basis of host-cell 
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growth at permissive, but not non-permissive, temperatures, 
and (3) high levels of recombinant fusion protein produc- 
tion. Further, since phage containing a heterologous in- 
sert produces an inactive beta-galactosidase enzyme, phage 
5 with inserts are typically identified using a beta-galacto- 
sidase colored-substrate reaction. 

In the screening procedure reported in Examples 1-3, 
individual cDNA libraries were prepared from the serum of 
" one PT-NANBH infeced chimpanzee (#771) and four PT-NANBH 
10 infected humans (designated EGM, BV, WEH, and AG). These 
five libraries were immunoscreened using PT-NANBH positive 
human or chimpanzee sera (Example 2): 111 lambda gtll 
clones were identified which were immunoreactive with at 
least one of the sera. Of these 111 clones, 93 were exa- 
15 mined for insert hybridization with normal DNA. The in- 
serts were radioactively labelled and used as probes 
against Hxndlll/EcoRI doubly-digested human peripheral 
lymphocyte (PBL) DNA (Example 3) . Approximately 46% 
(43/93) of the inserts hybridized with normal human PBL DNA 
20 and were therefore not pursued . Inserts from 11 PT-NANBH- 
immunopositive clones derived from chimpanzee #771 sera 
were characterized as exogenous to normal human PBL DNA 
(Example 3). Of these 11 clones 2 PT-NANBH clones were 
identified having the following characteristics . One clone 
25 (clone 40) was clearly exogenous by repeated hybridization 
tests against normal human PBL DNA, had a relatively small 
insert size (approximately 0.5 kilobases), and was quite 
unreactive with negative control serum. The second clone 
(clone 36) was shown to be reactive with multiple PT-NANBH 
30 antisera, had a relatively large insert size (approximately 
1.5 kilobases), and was exogenous by hybridization testing 
against normal human PBL DNA. The immunoreactive charac- 
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teristics of clones 36 and 40 are summarized in Table l 
_ ■ (Example 3). Clone 36 was immunqreactive with chimpanzee 
#771 sera and two HCV-positive human sera, AG and BV. The 
clone 36 antigen did not immunoreact with the negative con- 
5 trol sera SKF. Clone 40 was immunoreactive with chimpanzee 
#771 sera and was cleanly nonreactive when the negative 
control sera was used for screening. 

The DNA sequence of clone 36 was determined in part 
and is shown in Figure 4. This sequence corresponds to 
10 nucleotides 5010 to 6516 of the HCV sequence given in the 
Appendix. The DNA sequence was also determined for the 
, clone 40 insert (Figure 3). This. sequence is homologous to 
the HCV sequence (Appendix) in the region of approximately 
nucleotides 6515 to 7070. The inserts of two other chim- 
15 panzee #771 clones, clones 44 and 45, were found to be 
homologous to clone 40 by hybridization and sequence ana- 
lysis (Example 4). The sequences for clones 36 and 40 are 
contiguous sequences, with the clone 36 sequences being 
located 5' of the clone 40 sequences as presented in the 
20 ; Appendix. Accordingly, these two clones represent isola- 
tion of a significant block of the HCV genome by the, above- 
described immunoscreening methods. 

The four lambda gtll clones 36, 40, 44 , and 45 were 
deposited in the Genelabs Culture Collection, Genelabs 
incorporated, 505 Penobscot Drive, Redwood City, CA 94063. 
Further, the lambda gtll clones of clones 36 and 40 were 
deposited with the American Type Culture Collection, 12301 
Parklawn Dr. , Rockville MD, 20852, and given the deposit 
numbers ATCC No. 40901 and ATCC 40893. 

PT-NANBH Sequence Identification by Hybridization 
Methods . 
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The polynucleotides identified in Section II can be 
employed as probes in hybridization methods to identify 
further HCV sequences, and these can then be used as probes 
to identify additional sequences. The polynucleotides can 
be directly cloned or fragmented by partial digestion to 
generate random fragments . The resulting clones can be 
immunoscreened as described above to identify HCV antigen 
coding sequences . 

To illustrate how the inserts of clones 36 and 40 can 
be used to identify clones carrying HCV sequences, the 
insert of clone 40 was isolated and used as a hybridization 
probe against the individual cDNA libraries established in 
lambda gtlO (see above) . Using the clone 40 probe approxi- 
mately 24 independent hybridization-positive clones were 
15 plaque purified (Example 5) . The positive signals arose 
with different frequencies in cDNA libraries from the 
different serum sources, suggesting that the hybridization 
signals were from the serum sources, rather than resulting 
from some common contaminant introduced during the cDNA 
synthesis or cloning (Table 2) . One of the clones, 108-2- 
5, which tested positive by hybridization with the clone 40 
insert, had an insert of approximately 3.7 kb (Example 6). 
Since it had such a large insert, clone 108-2-5 was chosen 
for further analysis. The serum source of this cDNA clone 
25 was EGM human PT-NANBH serum (Example 1). 

The insert of 108-2-5 was isolated by EcoRI digestion., 
of the lambda gtlO clone, electrophoretic fractionation, 
and electroelution (Example 6) . The isolated insert was 
treated with DNase I to generate random fragments (Example 
6) , and the resulting digest fragments were inserted into 
lambda gtll phage vectors for immunoscr eening . The lambda 
gtll clones of the 108-2-5- fragments were immunoscreened 
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(Example 6) using human (BV and normal) and chimpanzee #771 
serum. Twelve positive clones were identified by first 
round immunoscreening with the human and chimp sera . Seven 
of the 12 clones were plague purified and rescreened using 
5 chimpanzee #771 serum. Partial DNA sequences of the insert 
DNA were determined for two of the resulting clones, 
designated 328-16-1 and 328-16-2. These two clones 
contained seguences essentially identical to clone 40 
(Example 6). 

0 , The clone 36 insert can be used in a similar manner to 

probe the original cDNA library generated in lambda gtlO. 
Specific subfragments of clone 36 may be isolated by 
Polymerase chain reaction or after cleavage with restric- 
tion endonucleases. These fragments can be radioactively 
15 labelled and used as probes against the cDNA libraries 
generated in lambda gt 10 (Example 1C) . In particular, the 
5' terminal seguences of the clone 36 insert are useful as 
probes to identify clones overlapping this region. 

Further,, the seguences provided by the terminal clone 
36 insert seguences and the terminal clone 40 insert, 
seguences are useful as specific sequence primers in first- 

f 

strand DNA synthesis reactions (Maniatis et al.; Scharf et 
al.) using, for example, chimpanzee #771 sera generated RNA 
as substrate. Synthesis of the second-strand of the cDNA 
is randomly primed. The above procedures identify or 
produce cDNA molecules corresponding to nucleic acid 
regions that are 5' adjacent to the known clone 3 6 and 40 
insert sequences. These newly isolated seguences can in 
turn be used to identify further flanking sequences , and so 
on, to identify the sequences composing the HCV genome. As 
described above, after new HCV sequences are isolated, the 
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polynucleotides can be cloned and immunoscreened to 
identify specific sequences encoding HCV antigens. 

This . section describes a method for producing and 
identifying HCV peptides which may be useful as HCV- 
diagnostic antigens. The present method is used to 
generate a series of overlapping linking fragments which 
span a segment of nucleic acid. The application of the 
method to generating a series of overlapping- linking frag- 
ments which span a 7,300 basepair segment of the HCV 
genome, whose sequence is given in the Appendix, will be 
described with reference to Figures 1 and 2 . 

As a first step in the method, and with reference to 
Figure 1, the nucleic acid of interest is obtained in 
double-strand DNA form. Typically, this is done by 
isolating genomic DNA fragments or by producing cDNAs from 
RNA species present in a sample fluid. The latter method 
is used to generate double-strand DNA from NANBH viral RNA 
present in serum from chimpanzees or humans with known PT- 
NANBH infection. Here RNA in the sample is isolated, e.g., 
by guanidinium thiocyanate extraction of PEG precipitated 
virions, and reacted with a suitable primer for first 
strand cDNA synthesis. 

First-strand cDNA priming may be by random primers, 
oligo dT primers, or sequence-specific primer (s). The 
primer conditions are selected to (a) optimize generation 
of cDNA fragments which collectively will span the nucleic 
acid segment of interest, and (b.) produce cDNA fragments 
which are preferably equal to or greater than about 1,000 
basepairs in length, m one method applied to HCV RNA, the 
first-strand synthesis is carried but using sequence- 
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specif ic primers which are complementary to spaced regions 
along the length of the known HCV genomic sequence. The 
primer position are indicated at A, B, C, and D in Figure 
2, which shows a map of the HCV genome segment. The 
5 basepair locations of the primers in the HCV genome are 
given in Example 7 below. Following first strand synthe- 
sis, the second cDNA strand is synthesized by standard 
methods. 

The~~linking fragments in the method are produced by 
10 sequence-specific amplification of the double-strand DNA. 
obtained as above, using pairs of overlap-region primers to 
be described. According to an important advantage of the 
methods of the present invention, it is possible to 
generate linking fragments even when the amount of double- 
15 strand DNA is too low for direct sequence-specific amplifi- 
cation. This limitation was found , for example, with HCV 
cDNA's produced from NANBH- infected serum. Here the amount 
of double-istranded DNA available for amplification is first 
amplified nonspecif ically by a technique known as Seguence- 
20 Independent Single-Primer Amplification (SISPA). 

The SISPA technique is detailed in co-owned U.S. 
Patent application for "RNA and DNA Amplification Tech- 
niques", Serial No v 224,961, filed July 26, 1988. The 
method as applied to amplification of HCV cDNA fragments is 
25 also described in Example 7. Briefly, known-sequence 
linker primers are attached to opposite ends of double- 
stranded DNA in a DNA sample. These linkers then provide 
the common end sequences for primer-initiated amplifica- 
tion, using primers complementary to the linker /primer 
30 sequences. Typically, the SISPA method is carried out for 
20-30 cycles of amplification, using thermal cycling to 
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achieve successive denaturation and primer-initiated 
polymerization of second strand DNA. 

Figure i illustrates the SISPA amplification of duplex 
DNA, to form, amplified fragments which have known-sequence 
regions P,. As seen, the fragment mixture includes at least 
some fragments which (a) overlap at regions P, with other 
fragments in the mixture and (b) contain complete linking 
regions between adjacent p, and P l+1 regions. Collectively, 
each linking region bounded by the "associated overlap 
regions making up the segment is present in at least one 
DNA fragment. 

The production of overlapping linking fragments, in 
accordance with the methods of the present invention, is 
carried out using the polymerase chain reaction (PCR) 
method described in U.S. Patent No. 4,683,195. In practic- 
ing this step of the method, first the total segment of 
interest is divided into a series of overlapping intervals 
bounded by regions of known sequence, as just described, 
in Figure 2, the 7,300 basepair segment of the HCV genome 
has been divided into 10 intervals, each about 500-1, ooo 
basepairs in length. The intervals are designated accord- 
ing to the forward F s and reverse r, primers used in amplify- 
ing the sequence, as will be described. The selection of 
the intervals is guided by (a) the requirement that the 
basepair sequence at each end of the interval be known/ and 
(b) a preferred interval length of between about 500 and 
2,000 basepairs. 

In the method applied to the 7,300 basepair segment of 
the HCV genome, the regions of overlap between the ten 
intervals were additionally amplified, to verify that the 
SISPA-amplified cDNA sample contained sufficient HCV cDNA 
to observe PCR-amplif ication of HCV linking fragments, and 
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that HCV regions along the entire length of the genome were 
available for amplif ication. Each overlap region in the 
segment can be defined by a pair of primers which includes 
a forward primer Fj and a reverse primer R; which are 
5 complementary to opposite strands of opposite ends of the 
overlap region. The primers are typically about 20 base- 
pairs in length and span an overlap region of about 200 
basepairs. The eleven overlap regions in the HCV segment 
and. the regions corresponding to the forward and reverse 
10 primers in each region are given in Example 8. 

. The primers Fj/Rj are added to the amplified DNA 
material in a PGR reaction mix, and the overlap region 
bounded by the primers is amplified by 20-30 thermal 
cycles/ The reaction material is then fractionated, e.g., 
15 by agarose gel electrophoresis, and probed for the presence 
of the' desired sequence, e.g., by Southern blotting 
(Southern), using a radiolabeled oligonucleotide probe 
which is specific for an internal portion of the overlap 
region. As described in Example 8, this method was 
20 successful in producing amplified fragments for each of the 
eleven Fj/R; overlap regions in the HCV genome segment. The 
overlap-region fragments may be used as probes for the 
corresponding (two) linking fragments connected by the 
overlap region. It is emphasized, however, that this 
25 amplification step was employed to confirm the presence of 
amplif iable cDNA along the length of the HCV genome, and 
not as an essential step in producing the desired linking 
fragments * The step is omitted from Figure 1. 

The linking fragments Fj/Rj are produced by a two-primer 
: 30 PCR procedure in which the SISPA-amplif ied DNA fragments 
are amplified by a primer pair consisting of the forward 
primer Fj of one overlap region and the reverse primer R^ of 
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an adjacent overlap region. The ten overlap regions in the 
HCV segment and the regions corresponding to the forward 
and reverse primers in each region are given in Example 9. 
Typical amplification conditions are give in Example 9. 
The amplified fragments in each reaction mixture are 
isolated and purified, e.g., by gel electrophoresis, to 
confirm the expected fragment size. Southern blots may be 
probed with oligonucleotide probes complementary to 
internal regions located between the fragment ends, to 
confirm, the expected sequence of the fragments. As shown 
at the bottom in Figure 1, the method generates the 
complete set of linking fragments, where each fragment is 
bounded by an overlap region Pj and Pi + ,. 

The method, as applied to generating ten overlapping 
linking fragments of the 7,300 basepair HCV genome, is 
described in Example 9. As demonstrated by size criteria 
on gel electrophoresis and by sequence criteria by Southern ; 
blotting, the method was successful in generating all ten 
of the overlapping fragments spanning the HCV genome. 

It will be appreciated that the above flanking 
sequence amplification method can be applied to the 
generation of DNA fragments corresponding to the insert 
sequences of clones 36 and 40, which have also been 
obtained by immunoscr eening . The linker primers flanking 
the inserts are easily used to generate sequences corre- 
sponding to the clone inserts. For example, two-primer 
amplification of the SISPA-amplified cDNA fragments (Exam- 
ple 7) using the F, 2 /R, primer pair, (the sequences of which 
are given in Example 8) is carried out under conditions 
similar to those described in Example 9. The amplified 
fragment mixture is fractionated by agarose electrophoresis 
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on 1.0 % agarose, and the expected band cut from the gel 
and eluted. 

The purified amplified fragment is treated with the 
Klenow fragment of DNA polymerase I to assure the molecules 
5 are blunt-ended. The fragment is then ligated to EcoRI 
linkers (Example 10) . The mixture is digested with EcoRI 
and inserted into the lambda gtll vector. The resulting 
clones contain the entire coding sequences of either the 
clone 36 or clone 40 inserts. 
10 ^.Alternatively, the original amplified 36/40 fragment 

(primers F 12 /I^) is brief ly treated with Exonuclease III 
(Boehringer Mannheim, as per manufacturer's instructions) 
to generate a family of fragments with different 5' ends. 
The digestion products are treated as above and ligated 
15 into the lambda gtll vector. The resulting plaques are 
then immunoscreened. 

Further, different sets of primers, other than the 
F 12 /R9 primers described above, can be used to directly 
generate sequence encoding all, or portions, of clones 36 
20 and 40. For example, primers F 8 /R, can generate a fragment 
corresponding to a portion of the 3' sequences of the 
insert of clone 36 (Figure 4) and all of the insert 
- sequences, of clone 40 (Figure 3) . . Also, primers F 7 /R 8 can 
be used to directly generate a fragment corresponding to a 
25 portion of the 5' sequences present in the insert of clone. 
36 (Figure 4) • 

V. PT-NANB H Immunoreactive Peptide Fragments 

Several, novel peptide antigens which are immunoreac- 
30 tive with sera from human and chimpanzee NANBH-inf ected 
sera have been generated from the NANBH linking fragments 
produced above, in accordance with the methods of the 
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present invention. Further, this method has confirmed 
antigenic regions previously identified by cDNA library 
immunoscreening (Section II above) . The antigen peptides 
derived from linking fragments are preferably produced in 
a method which involves first digesting each of the above 
linking fragments with DNasel under partial digestion 
conditions, yielding DNA digest fragments predominantly in 
the 100-300 basepair size range, as illustrated in Example 
10. The digest fragments may be" size fractionated, for 
example by gel electrophoresis, to select those in the 
desired size range. 

The digest fragments from each linking fragment are 
then inserted into a suitable expression vector. One 
exemplary expression vector is lambda gtll, the advantages 
15 of which have been described above. 

For insertion into the expression vector, the digest 
fragments may be modified, if needed, to contain selected 
restriction-site linkers, such as EcoRI linkers, according 
to conventional procedures. Typically, the digest frag- 
ments are blunt-ended, ligated with EcoRI linkers, and 
introduced into UcoSJ-cut lambda gtll. such recombinant 
techniques are well known in the art (e.g., Maniatis et 
al.) . 

The resulting viral genomic library may be checked to 
confirm that a relatively large (representative) library 
has been produced for each linking fragment. This can be 
done, in the case of the lambda gtll vector, by infecting 
a suitable bacterial host, plating the bacteria, and exami- 
ning the plaques for loss of beta-galactosidase activity, 
as evidenced by clear plaques . 

The presence of a digest-fragment insert in the clear 
plaques can be confirmed by amplifying the phage DNA, using 
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primers specif ic for the regions of the gtll phage flanking 
the EcoEI insert site, as described in Example 1 OB. The 
results in Table 3 show that a large percentage of the 
plaques tested in each linking fragment library contained 
5 a digest-fragment insert. 

The linking-f ragment libraries may also be screened 
for peptide antigens which are immunoreactive with human or 
chimpanzee sera identified with PT-NANBH chronic, convales- 
cent, or acute infection. One preferred immunoscreening 

10 methpd is described in Example 10B. Here recombinant pro- 
tein produced by the phage-inf ected bacteria is transferred 
from the plaques to the filter. After washing, the filter 
is incubated with test serum, and then reacted with repor- 
ter-labeled anti-human IgG antibody. The presence of the 

15 peptide antigen on the filter is then assayed for the pre- 
sence of the reporter. As seen from Table 3, several of 
the linking-f ragment libraries were positive for immunore- 
active peptides in the primary screen. 

The immunoscreening method just described can be used 

20 to identify library plaques from each of the linking libra- 
ries which are immunoreactive with sera from human or chim- 
panzee with known chronic, convalescent, or acute PT-NANBH 
infection. One exemplary screening procedure is given in 
Example 11, where the ten HCV linking-f ragment libraries 

25 are screened with known PT-NANBH (a) human chronic serum, 
(b) chimpanzee acute pooled sera and (c) chimpanzee chronic 
pooled sera. Of the ten libraries examined, only the F } /R l0 
library did not give positive immunoreaction with any of 
the three sera. Several of the fragment libraries, inclu- 

30 ding F3/R4, F 6 /R 12 , F n /R 7 , and F 7 /R 8 showed five or more 
positive reactions with chimpanzee acute sera, indicating 
that these libraries each express one or more peptide anti- 
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gens which are useful for detecting \u • 
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related clone, designated 409-1-1 (abc) , was described in 
the parent to the present application (U. S. Application 
Ser. No. 07/505,611, herein incorporated by reference). 
Clone 409-l-l(abc) was designated 409-1-1 in the parent 
5 application. The a, b and c designations refer to three 
regions of the 409-1-1 (abc) sequence (see Figure 5). The 
5-1-1 coding sequence was isolated by polymerase chain 
reaction using oligonucleotide primers complementary to the 
ends of the 5-1-1 coding region, and cloned into lambda 
10 gtll^.for expression under induction conditions of a fused 
beta-galactosidase protein which includes the 5-1-1 antigen 
peptide region. The 5-1-1 phage was identified and plaque 
purified by similar methods. 

The 409-1-1 (c-a) and 5-1-1 antigens were compared by 
15 plaque immunoscreening with a panel of 28 sera from normal 
(2 donors), human PT-NANBH-chronic (6 donors), chimpanzee 
normal (7 donors) , chimpanzee PT-NANBH-acute (5 donors), 
and chimpanzee PT-NANBH-chronic (8 donors), with the 
results shown in Table 5 in Example 12. As can be seen in 
20 Table 5, the 5-1-1 and 409-l-l(c-a) peptides reacted with 
most of the human and chimpanzee chronic sera, although the 
409-1-1 (c-a) peptide detected a higher percentage of human 
chronic sera samples (83% vs 66%). The chronic human serum 
which was detected by the 409-1-1 (c-a) peptide, but not by 
25 5-1-1 was from a patient (BV) who died of fulminant NANBH 
infection. Because the 5-1-1 antigen is contained within 
the C-100 antigen in the commercially available kit format 
(Ortho/Chiron) , it was of interest to determine whether the 
C-100 antigen gave a broader range of reactivity with the 
30 test sera. The results are shown at the right in Table 5 
below. The only human NANBH serum that was tested was the 
above BV serum which was not detected by 5-1-1. This serum 
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was also not immunoreactive with the C-100 antigen (0/1). 
Nor was the C-100 antigen reactive with any of the five 
acute chimp sera which were tested (0/5) . It is also noted 
that the 409-l-l(c-a) antigen is inmunoreactive with 3 of 
5 the 5 acute chimpanzee sera tested, compared with only l 
out of 5 for the 5-1-1 antigen. . The results indicate that 
the 409-1-1 (c-a) antigen has broader immunospecif icity with 
PT-NANBH sera/ and thus would provide a superior diagnostic 
agent. The results obtained with 409-1-1 (c-a) are compara- 
10 ble £0 the results obtained using 409-1-1 (abc) . 

it is noted here that the 409-1-1 (abc) coding sequence 
is contained in the F^R, linking fragment and does not 
overlap the sequence of the C-100 (and 5-1-1) coding region 
which is in the F 4 /R5 and Fs/R« linking fragments. The 
relatively long coding sequence of the 409-1-1 (abc) peptide 
illustrates that larger size digest fragments- (substantial- 
ly greater than 300 basepairs) are generated in the partial 
digest step used in producing digest fragments for antigen 
. expression. 

The 409-1-1 (abc) peptide, which forms one aspect of 
the invention, has the amino acid sequence which is 
presented as SEQ ID NO: 10. The DNA coding sequence 
corresponding, to the insert in the 409-1-1 clone is given 
in Figure 5 and is presented as SEQ ID NO : 9 . 

The 409-1-1 (c-a) peptide, which forms another aspect 
of the invention, has the amino acid sequence presented as 
SEQ ID NO: 8. The DNA coding sequence corresponding to the. 
insert in the 409-1-1 (c-a) clone is given in Figure 6 and 
is presented as SEQ ID NO: 7. The relationship between the 
coding sequence of 409-1-1 (c-a) and 409-1-1 (abc) is 
outlined in Example 12. Briefly, 409-1-1 (c-a) consists of 
a carboxy terminal region of 409-1-1 (abc) moved, to the 
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amino terminus of the 409-1-1 coding sequence, with a 

truncation of the remaining 3' 409-1-1 (abc) coding se- 
quence. 

More generally, the invention includes a peptide 
5 antigen which is immunoreactive with sera from humans with 
HCV infection. . Such peptide antigens are readily identifi- 
able by the methods of the present invention. 

Antigens obtained from the region corresponding to the 
HCV sequences encoding the 409-1-1 antigens were further 

10 character- 
ized as follows. The primers shown in Table 7 were used to 
generate a family of overlapping amplified fragments 
derived from this region. Several templates were used for 
the DNA amplification reactions (Table 8). The relation- 

15 ships of the coding sequences of the resulting clones to 
each other are graphically illustrated in Figure 7. The 
amplified fragments were then cloned into lambda gtii 
vectors (Example 13) . 

These cloned fragments were then immunoscreened 
20 (Example 13) . Seven of the nine clones tested positive by 
^ preliminary immunoscreening (Table 9) . These seven clones 
were then tested against a more extensive battery of PT- 
NANBH serum samples, including numerous human clinical 
samples. The sensitivity of the antigens, in decreasing 
25 order, for reactivity with the serum used for screening was 
as follows: 33cu > 33c > 409-1-1 (c-a) > 409-1-1-F1R2 > 
409-l-l(abc) « 409-1-la > 5-1-1 > .409-1-1 (c+270) . As can 
be seen from these results all of the alternative clones, 
with the exception of 409-1-1 (c+270), provided a more 
30 sensitive antigen than 5-1-1. However, although 3 3cu and 
33c were very sensitive antigens, in this assay they 
reacted slightly with serum which was known to be negative 
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for HCV and may therefore be less specific. Accordingly, 
the 409-1-1 series appears preferable for use as diagnostic 
antigens since they are more specific to HCV-induced 
antibodies. 

5 The immunoscreening was extended to include the clone 

36 and 45 encoded epitopes: the insert of clone 45 is 
essentially the same as the insert of clone 40 (Example 4) . 
As can be seen from the results presented in Table 11, the 
antigens produced by clones 36 and 40, while not as 
10 sensitive as 409-1-1 (c-a) , do yield HCV-specific immunopo- 
sitive signals with selected samples. Accordingly, the two 
methods presented in the present invention, (i) immuno- 
screening of cDNA libraries generated directly from sera- 
derived SNA, and (ii) immunoscreening of amplif ied-fragment 
15 libraries, can both be seen to be effective methods of. 
identifying cDNA sequences encoding viral antigens. 
Further, confirmation of the clone 36 and 40 encoded 
antigens by identification of antigens corresponding to 
these HCV regions using the amplif ied-fragment library 
method validates the usefulness of the amplif ied-fragment 
method. 



B. Peptide Purification 

The recombinant peptides of the present invention can 

25 be purified by standard protein purification procedures 
which may include differential precipitation, molecular 
sieve chromatography, ion-exchange chromatography , isoelec- 
tric focusing , gel electrophoresis and affinity chromatog- 
raphy. In the case of a fused protein, such as the beta- 

30 galactosidase fused proteins prepared as above, the fused, 
protein can be isolated readily by affinity chromatography, 
by passing cell lysis material over a solid support having 
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surface-bound anti-beta-galactosidase antibody. For 
example, purification of a beta-galactosidase/fusion 
protein/ derived from 409-1-1 (c-a) coding sequences, by 
affinity chromatography is described in Example 14. 
5 A fused protein containing the 409-1-1 (a) peptide 

fused with glutathione-S-transf erase (Sj26) protein has 
also been expressed using the pGEX vector system in E. coli 
KM392 cells (Smith) . This expression system has the 
advantage that the fused protein is generally soluble and 
10 therefore can be isolated under hon-denaturing conditions. 
The fused Sj 26 protein can be isolated readily by glutathi- 
one substrate affinity chromatography (Smith) . This method 
of expressing this fusion protein is given in Example 15 
and is applicable to any of the other antigen coding 
15 sequences described by the present invention. 

Also included in the invention is an expression 
vector, such as the lambda gtll or pGEX vectors described 
above, containing the 409-i-l(a) coding sequence and 
expression control elements which allow expression of the 
20 coding region in a suitable host. The coding sequence is 
contained in the sequence given above corresponding to 
basepairs 2755-3331 of the HCV genome. The control 
elements generally include a promoter, translation initia- 
tion codon, and translation and transcription termination 
25 sequences, and an insertion site for introducing the insert 
into the vector. In the case of the two vectors illu- 
strated in Example 15, the control elements control the 
synthesis of the protein which is fused with the heterolo- 
gous peptide antigen. Such expression vectors can be 
30 readily constructed for the other antigen coding sequences 
described by the present invention. 
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The lambda gtll vectors containing the following 
coding regions have been deposited with The American Type 
Culture Collection, 12301 Parklawn Dr., Rockville MD, 
20852: the 409-1-1 (abc) coding region, designated gtll/1 
5 409-1-1 (abc), ATCC No. 40876; the 409-1-1 (c-a) coding 
region, designated gtll/4 09-1— l (c-a) ATCC No. 40792; clone 
36, designated gtll/36, ATCC No. 40901; and, clone 40, 
designated gtll/4 0, ATCC No. 40893. 

10 VTI^. Immunoreactive clo nes of i-h e HCV-Capsid Antigen 

At the 1990 Congress of Hepatology a region of the 
full-length HCV nucleic acid sequence was presented, 
nucleotide residues 325-970, containing the HCV non-coding,' 
structural core protein and envelope protein- coding sequen- 
ces as capsid parts of a polyprbtein sequence . : During the 
course of experiments performed in support of the present 
invention, the coding region that corresponds to the capsid 
protein was more clearly defined. 

Polymerase Chain Reaction primers were constructed 
from selected HCV sequence which would generate amplifica- 
tion products of nucleotides 325-970 of the full, length HCV 
genome (see Appendix) . These primers, SF2(C) and SRl(C), 
are. presented in Example 16 . The primers contained non- 
complementary sequences which encoded restriction enzyme 
cleavage sites to facilitate subsequent cloning manipula- 
tions. The primers were used in amplification reactions 
containing SISPA-amplified HCV cDNA molecules (Example 7) 
as substrate. The resulting amplification products were 
cloned into the pGEX and pET vectors (Example 16). The 
pGEX vector allows expression of inserted coding sequences 
as fusion proteins to the sj26 protein, glutathione-s- 
transf erase. Insertion into the pET vector allows expres- 
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sion of the inserted coding sequences, independent of fusion 
sequences. 

These clones were then immunologically screened using 
sera known to be reactive with HCV-antigens (Example 17) . 
Several clones in both vectors were identified which were 
immunoreactive with the anti-HCV sera (in pGEX, clones 14 , 
15, 56, 60, and 65, Example 17, Table 13) . It was observed 
that the fusion proteins which were produced from the 
clones in pGEX were smaller than expected. 

, Clone 15 was selected for scaled up production of the 
Sj26/HCV-antigen fusion protein. The fusion protein 
product (approximately 29 kd) was smaller than the expected 
fusion product (approximately 50 kd, Example 17) . Further, 
the yield of the fusion protein from this preparation was 
unexpectedly low. 

Clones 15 and 56 were chosen for nucleic acid sequenc- 
ing of the HCV-antigen containing inserts (Example 18). 
The sequences of the two clones were very similar with the 
exception that clone 15 had a termination codon starting at 
nucleotide position 126. This result suggested that the 
amino terminal 42 amino acids encoded by the HCV insert 
were immunogenic in regard to the anti-HCV sera used for ' 
immunoscreening . 

To test the suggestion that the amino terminus of the 
HCV polyprotein was antigenic, a synthetic oligopeptide was 
constructed essentially corresponding to amino acid 
residues 6-24 of Figure 8A: this peptide had very strong 
immunoreactivity with anti-HCV sera as tested by ELISA. 
PCR primers (Figure 8, CI and NC105) were designed to 
generate a clone corresponding to this region (Figure io> 
C1NC105, SEQ ID NO: 25). Three other synthetic peptides 
were tested, one of which was strongly immunoreactive with 
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anti-HCV sera (amino acid residues 47-74, Figure 8A) and 
, two which were weakly immunoreactive (amino acid residues 
39-60 and 101-121, Figure 8A) . These synthetic peptides 
confirm the presence of a strong antigenic region at the 
5 amino-terminal end of the HCV polyprotein in the capsid 
protein region. 

The sequence of clone 56, designated pGEX-GGl-5 6, is 
shown in Figure 8A and is presented in the sequence listing 
: as SEQ ID NO: 11. The sequence shows that the cone has a 

10 long, open reading frame. When production of the fusion 
protein was induced, a fusion protein smaller than the 
expected product was produced, similar in size to the clone 
,15 product. The nucleotide sequence of the clones revealed 
a region which is prone to translational frameshifting, 

15 AAAAAAAAAA (Atkins et al . , Wilson et al.). Such a nucleo- 
tide sequence may contribute to the low protein yields when 
these clones are expressed in E. coli. In an effort to 
improve the level of fusion protein expression the third 
nucleotide position of several codons through this region 

20 was changed to a G resulting in the sequence AGAAGAAGAA 
(Example 20) : . the changes had no effect on the protein 
coding sequence (amino acid residues 8-10, Figure 8A) . 
This modified insert was cloned into the pGEX vector and 
the resulting plasmid named pGEX-CapA. 

25 A hydropathicity plot was generated for the protein 

coding sequences of the insert of pGEX-GGl (Example 19, 
Figure 9) . The results of this analysis indicated that the 

; carboxy-terminal region of the. encoded protein, approxi- 
mately amino acid residues 168-182, had the potential for 

30 being a membrane spanning segment. Since it was unlikely 
that the membrane spanning segment would provide a strong 
antigen and since overproduction of proteins with these 
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regions can adversely affect the growth of bacterial cells, 
a series of carboxy terminal deletions were generated from 
pGEX-CapA (Example 20). 

To generate the carboxy terminal deletions PGR primers 
were designed to be complementary to various regions of the 
pGEX-CapA insert encoded protein. The primers used to 
generate the carboxy terminal deletions are given in Table 
14 and the location of the primers relative to the insert 
coding sequence is presented in Figure 8B. The carboxy 
terminal deletion fragments were cloned into the pGEX 
vector and Sj26/HCV-insert fusion proteins were produced. 
These fusion proteins were then screened with anti-HCV sera 
and an epitope map generated for the immunoreactive 
polypeptides (see Figure 10). Clones C1NC270, C1NC360, and 
C1NC450 all expressed high levels of the Sj26/HCV fusion 
proteins. Further, these fusion proteins all corresponded 
to the size predicted from their nucleic acid coding 
sequences. Clones C1NC520 and C1NC580 gave poor yields of 
fusion proteins suggesting that when the hydrophobic region 
of amino acid residues 168-182 is present it may in part be 
responsible for the poor protein yields previously, ob- 
tained. - 

The deletion analysis was continued to further dissect 
the antigenic regions of the pGEX-CapA encoded HCV antigen. 
A series of amino terminal deletions (primers in Table 15) 
combined with carboxy terminal deletions were generated 
using PCR primers: the locations of all the primers are 
illustrated in Figure 8B. 

The results of the deletion analysis are presented in 
Table 16 and in Figure 10. These results, combined with 
the synthetic peptide data presented above, suggest that 
the capsid protein (which comprises the N-terminus of the 
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HCV polyprotein) has two dominant immunoreactive regions. 
Both of these immunoreactive regions are useful use as 
diagnostic antigens. The region comprising the first 35 
amino acids spans one of the epitopes and the region 
spanning residues 34-90 encompasses the other strongly 
immunoreactive domain. 

In summary, all of the pGEX clones containing the N- 
terminus of the HCV polyprotein and either 34, 90, 120 or 
150 residues produced large quantities of fusion protein 
which, was shown to be efficiently recognized by HCV 
positive sera. Expression of the PCR inserts containing 
amino acid residues 34-90 was also strongly immunoreactive, 
whereas inserts encoding residues 90-120 or 90-150 were not 
immunoreactive, demonstrating that these regions were not 
recognized by human sera. This result suggests that the 
regions important for the production of recombinant 
antigens is contained between residues 1 through 90. 

Analyses of the pGEXClNC450 protein and the pET360 
protein showed that the inclusion of these antigens, in 
Western and ELISA formats permitted the identification of 
HCV positive sera which had been previously identified as 
either HCV negative or HCV indeterminate. Accordingly, the 
inclusion of these epitopes permits the generation of an 
improved screening system (Example 21). 
25 VIII. Ant i -HCV Anting a ntibodies 

In another aspect, the invention includes antibodies 
specific against the recombinant antigens of the present 
invention. Typically, to prepare antibodies, a host 
animal, such as a rabbit, is immunized with the purified 
antigen or fused protein antigen. The host serum or plasma 
is collected following an appropriate time interval, and 
this serum is tested for antibodies specific against the 
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antigen. Example 15 describes the production of rabbit 
serum antibodies which are specific against the 409-1-1 
antigens in the Sj26/409-l-l (a) and beta-galactosidase/409- 
1-1 (c-a) fusion protein. These techniques are equally 
applicable to the other antigens of the present invention. 

The gamma globulin fraction or the IgG antibodies of 
immunized animals can be obtained, for example, by use of 
saturated ammonium sulfate or DEAE Sephadex, or other 
techniques known to those skilled in the art for producing 
polyclonal antibodies. . 

Alternatively, the purified antigen or fused antigen 
protein may be used for. producing monoclonal antibodies. 
Here the spleen or lymphocytes from an immunized animal are 
removed and immortalized or used to prepare hybridomas by 
methods known to those skilled in the art. To produce a 
human-human hybridoma, a human lymphocyte donor is select- 
ed. A donor known to be infected with an HCV virus (where 
infection has been shown for example by the presence of 
anti-virus antibodies in the blood) may serve as a suitable 
lymphocyte donor. Lymphocytes can be isolated from a 
peripheral blood sample or spleen cells may be used if the 
donor is subject to splenectomy. Epstein-Barr virus (EBV) 
can be used to immortalize human lymphocytes or a human 
fusion partner can be used to produce human-human hybrido- 
mas . Primary in vitro immunization with peptides can also 
be used in the generation of human monoclonal antibodies. 

Antibodies secreted by the immortalized cells are 
screened to determine the clones that secrete antibodies of 
the desired specificity, for example, using the Western 
blot method described in Example 15. 



WO 91/15516 



PCT/US91/02370 



36 

IX. Utility 

A. Diagnostic Method and Kit 

The antigens obtained by the methods of the present 
invention are advantageous for use as diagnostic agents for 
5 anti-HCV antibodies present in HCV-infected sera; particu^ 
larly, the 409-1-1 antigens (409-1-1 (abc) , 409-l-i(c-a) , 
and related antigens (see Table 9) ; the clone 36 antigen; 
and, the clone 40 antigen and the capsid antigen. As noted 
above, many of the antigens provide the advantage over 
10 known. HCV antigen reagents 5-1-1 and C-100 in that they are 
immunoreactive with a wider range of PT-NANBH infected 
sera, particularly acute-infection sera. This is particu- 
larly true of combinations of the 409-1-1 antigens with the 
HCV-core protein antigens as described in Section Vli 
15 above. The antigens 409-1-1 (c-a) and Cap450 have been 
combined in an ELISA test kit and tested against HCV test 
kits produced by Abbott and Ortho. The antigens of the 
present invention consistently identify more HCV+ samples 
with a high degree of specif icity which is comparable to or 
20 better than the Abbott and Ortho test kits. 

In one preferred diagnostic configuration, test serum 
is reacted with a solid phase reagent having a surface- 
bound HCV antigen (or antigens) obtained by the methods of 
the present invention, e.g., the 409-1-1 (c r a) antigen and 
25 the Cap4 50 antigen. After binding anti-HCV antibody !to the 
reagent and. removing unbound serum components by washing, 
the reagent is reacted with reporter-labeled anti-human 
antibody to bind reporter to the reagent in proportion to 
the amount of bound anti-PT-NANBH antibody on the solid 
support. The reagent is again washed to remove unbound 
labeled antibody, and the amount of reporter associated 
with the reagent is determined. Typically, the reporter is 
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an enzyme which is detected by incubating the solid phase 
in the presence of a suitable fluorometric or colorimetric 
substrate. 

The solid surface reagent in the above assay is 

5 prepared by toown techniques for attaching protein material 
to solid support material, such as polymeric beads, dip 
sticks, 96-well plate or filter material. These attachment 
methods generally include non-specific adsorption of the 
protein to the support or covalent attachment of the 
10 protein, typically through a free amine group, to a 
chemically reactive group on the solid support, such as an 
activated carboxyl , hydrpxyl , or aldehyde group . 

In a second diagnostic configuration, known as a 
homogeneous assay, antibody binding to. a solid support 
15 produces some change in the reaction medium which can be 
directly detected in the medium. Known general types of 
homogeneous assays proposed heretofore include (a) spin- 
labeled reporters, where antibody binding to the antigen is 
detected by a change in reported mobility (broadening of 
20 the spin splitting peaks) , (b) fluorescent reporters, where 
binding is detected by a change in fluorescence efficiency, 
(c) enzyme reporters, where antibody" binding effects 
enzyme/ substrate interactions, and (d) liposome-bound 
reporters, where binding leads to liposome lysis and 
25 release of encapsulated reporter. The adaptation of these 
methods to the protein antigens of the present invention 
follows conventional methods for preparing homogeneous 
assay reagents. 

In each of the assays described above, the assay 
30 method involves reacting the serum from a test individual 
with the protein antigen and examining the antigen for the 
presence of bound antibody. The examination may involve 
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.attaching 'a labeled anti-human antibody to the antibody 
being examined, either IgM (acute phase) or IgG (convales- 
cent or chronic phase) , and measuring the amount of 
reporter bound to the solid support, as in the first 
5 method, or. may involve observing the effect of antibody 
binding on a homogeneous assay reagent, as in the second 
method. . 

Also forming part of the invention is an assay system 
or kit for carrying out the assay, method just described. 
10 Thejcit generally includes a support with surface-bound 
recombinant HCV antigen (e.g. , the 409-1-1 antigens, etc., 
as above) , and a reporter- labeled anti-human antibody for 

detecting surface-bound anti-PT-NANBH-antigen antibody. 

As discussed in Section III above, peptide antigens 
15 associated with several of the linking-f ragment libraries 
are immunoreactive with acute NANBH sera from chimpanzees, 
indicating that the peptides would be useful for detecting 
acute NANBH infection in human serum, in particular, one 
or more peptide antigens produced by the linking fragment 
!0 libraries, F s/R9 (reactive with chronic sera), F 3 ^, F 6 B I2 , 
*nR»# F 7 Rj, or F 7 R„ (which are shown in Example li to produce 
one or more peptide antigens which are immunoreactive with 
acute chimpanzee sera) can be combined with the 409-1-1 
antigens to provide a diagnostic composition capable of 
immunoreacting with a high percentage of both chronic and 
acute human NANBH serum samples. Further, as discussed in 
Section VII above inclusion of the HCV-capsid protein 
antigens of the present invention add an extra level of 
sensitivity. 

A third diagnostic configuration involves use of the 
anti-HCV antibodies, described in Section VI above, capable 
of detecting HCV specific antigens. The HCV antigens may 
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be detected, for example, using an antigen capture assay 
where HCV antigens present in candidate serum samples are 
reacted with an HCV specific monoclonal antibody. The 
monoclonal antibody is bound to a solid substrate and the 
5 antigen is then detected by a second, different labelled 
ant i -HCV antibody: the monoclonal antibodies of the 
present invention which are directed against HCV specific 
antigens are particularly suited to this diagnostic method. 

10 B. -Peptide Vaccine 

The HCV antigens identified by the methods of the 
present invention, e.g. 409-1-1 (c-a) and HCV-core protein 
antigens, can be formulated for use in a HCV vaccine. The 
vaccine can be formulated by standard methods, for example, 
15 in a suitable diluent such as water, saline, buffered 
salines, complete or incomplete adjuvants, and the like. 
The immunogen is administered using, standard techniques for 
antibody induction, such as by subcutaneous administration , 
of physiologically compatible, sterile solutions containing 
20 inactivated or attenuated virus particles or antigens. An 
immune response producing amount of virus particles is 
typically administered per vaccinizing injection, typically 
in a volume of one milliliter or less. 

A specific example of a vaccine composition includes, 
25 in a pharmacologically acceptable adjuvant, a recombinant 
409-1-1 (c-a) peptide. The vaccine is administered at 
periodic intervals until a significant titer of anti-HCV 
antibody is detected in the serum. Such vaccines can also 
comprise combinations of the HCV antigens of the present 
30 invention. 
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C. Passive Immunoprophylaxis 

The anti-HCV antibodies of the invention can be used 
as a means of enhancing an anti-HCV immune response since 
antibody-virus complexes are recognized by macrophages and 
other effector cells. The antibodies can be administered in 
amounts similar to those used for other therapeutic 
administrations of antibody. For example, pooled gamma 
globulin is administered at 0.02-0.1 ml/ lb body weight 
during the early incubation of other viral diseases such as 
rabies, measles and hepatitis B to interfere with viral 
entry into cells. Thus, antibodies reactive with, for 
example, the 409-1-1 (c-a) antigen can be passively adminis- 
tered alone in a "cocktail" with other anti-viral antibod- 
ies or in conjunction with another anti-viral agent to a 
15 host infected with a PT-NANBH virus to enhance the immune 
response and/ or the effectiveness of an antiviral drug. 

The following examples illustrate various aspects of 
the invention, but are in no way intended to limit the 
20 scope thereof. . 

* i 

Materials 

E. coll DNA polymerase I (Klenow fragment) was 
obtained from Boehringer Mannheim Biochemicals (Indianapo- 
25 lis, IN). T4 DNA ligase and T4 DNA polymerase were 
obtained from New England Biolabs (Beverly, MA) ; Nitrocel- 
lulose filters were obtained from Schleicher and Schuell 
(Keene, NH) . 

Synthetic oligonucleotide linkers and primers were 
30 prepared using commercially available automated oligonu- 
cleotide synthesizers. Alternatively, custom designed 
synthetic oligonucleotides may be purchased, for example, 
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from Synthetic Genetics (San Diego, CA) . cDNA synthesis 
kit and random priming labeling kits were obtained from 
Boehringer-Mannheim Biochemical (BMB, Indianapolis, IN) . 

5 Example 1 

Construction of NANB-containina cDNA libraries 

A. Infection of a Chimpanzee with HCV 
A chimpanzee (#771) was inoculated with a Factor VIII 

preparation which was known to cause parenteral ly transmit- 
[ 10 ted.non-A non-B hepatitis (PT-NANBH) in human patients 
treated with the Factor VIII concentrate (Bradley). Post- 
infection ultras tructural changes in liver tissue were 
observed by electron microscopy and ALT (alanine amino 
transferase) elevation was observed in the infected 
15 chimpanzee. These observations are consistent with PT- 
NANBH infection. 

B. Isolation of RNA from Sera 
Serum was collected' from the above described infected - 

20 chimpanzee (#771) and four human PT— NANBH clinical sources 
(EGM, BV, CC and WEH) . Ten milliliters of each undiluted 
serum was pelleted by centrifugation at 30K, for 3 hours in 
an SW40 rotor, at 4°C. RNA was extracted from each result- 
ing serum pellet using the following modifications of the 
25 hot phenol method of Feramisco et al. Briefly, for each 
individual serum sample, the pellet was resuspended in 0.5 
ml of 50 mM NaOAc, pH==4 . 8 , containing 1% SDS . An equal 
volume of 60°C phenol was added and incubated for 15 minutes 
at 60°C with occasional vortexing. This mixture was 
30 transferred to a 1.5 ml microfuge tube and spun for two 
minutes at room temperature in a table top microfuge . The 
aqueous phase was transferred to a new microfuge tube . To 
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the aqueous phase, 50 (il of 3 H NaOAc, pH=5.2, and two 
volumes of 100% ethanol were added. This solution was held 
at -70°C for approximately io minutes and then spun in a 
microfuge at 4°C for 10 minutes. The resulting pellet was 
5 resuspended in 100 jil of sterile glass distilled Water. To 
this solution 10 fil of NaOAc, pH=5.2, and two volumes of 
100% ethanol were added. The solution was held at -70°C for 
at least 10 minutes. The RNA pellet was recovered by 
centrif ugation in a microfuge at 12,000 X g for 15 minutes 
10 at 5°C. The pellet was washed in 70% ethanol and dried 
. under vacuum. 

C. Synthesis of cDNA 

(i) First Strand Synthesis 

15 The synthesis of cDNA molecules was accomplished as 

follows. The above described RNA preparations were each 
resuspended in 26 nl of sterile glass distilled water 
(treated with diethyl pyrocarbonate, Maniatis.et al. ) , 5 >1 
of 10 X reaction buffer (0.5 M Tris HCl, pH=8.5; 0.4 M KC1; 

20 o.l M MgCl 2 ; 4 mM DTT) , io /il of a nucleotide solution 
(dGTP, dATP, dTTP, and dCTP, each at a concentration of 5 
mM), 5 >1 random primer, 0.25 /xl of ^P-dCTP, 2 fil AMV 
reverse transcriptase, and 2 fil of RNASIN ( Pr omega ), in a 
total reaction volume of 50 nl. This mixture was incubated 

25 for one hour at 42°C. 

(ii) Second Strand cDNA Synthesis 

To the first strand synthesis reaction mixture the 
following components were added: 55 /il of 2 X second 
strand synthesis buffer (50 mM Tris HCl, pH=7.0; 60 mM 
KC1); 2 /il RNase H; 5 /tl DNA polymerase I, and 2 fil of the 
above described nucleotide solution. The reaction was 
incubated for one hour at 12°C, followed by a one hour' 
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incubation at room temperature . The reaction mixture was 
extracted with ah equal volume of 1:1 phenol/chloroform, 
followed by an extraction using 24:1 chloroform/ isoamyl 
alcohol. To each reaction mixture 1 /xl of 10 mg/ml tRNA 
5 was added as carrier. The cDNA was precipitated by the 
addition of two volumes of 100% ethanol and chilling at - 
70°C for 15 minutes. The cDNA was collected by centrifu- 
gation, the pellet washed with 70% ethr.no 1 and dried under 
■ ■vacuum. . 

10 ...(iii) Preparation of the Double Stranded cDNA for 

cloning 

To provide vector compatible ends each of the double 
stranded cDNA preparations was tailed with EcoRI linkers in 
the following manner. 

15 The cDNA was treated with EcoRI methylase under the 

following conditions: The cDNA pellet was resuspended in 
2 0 //I lx methylase buffer (50 mM Tris HCl, pH=7.5; 1 mM 
EDTA; 5 mM DTT) , 2 pi 0.1 mM S-adenosyl-methionine (SAM) and 
2 //l EcoRI methylase (New England Biolabs) . The reaction 

20 was incubated for 30 minutes at 37°C. TE buffer (10 mM 
Tris-HCl # pH=7.5; 1 mM EDTA, pH=?8.0) was added to achieve 
a final volume of 80 //l. The reaction mixture was extracted 
with an equal volume of phenol/ chloroform (1:1) and then 
with an equal volume of chloroform/ isoamyl alcohol (24:1). 

25 The cDNA was precipitated with two volumes of ethanol. 

To maximize the number of blunt ends for the addition 
of linkers (Maniatis et al, 1982) the cDNA was then treated 
with the Klenow fragment of DNA. polymerase I. The pelleted 
cDNA was resuspended in 11 . 5 fil of distilled water . The 

30 following components were added to the resuspended cDNA: 

4 fjl of 5 X NTB (10 X NTB stock solution: 0.5 M Trie. CI 
pH=7.2; 0.1 M MgS0 4 ; l_mM dithiothreitol- (DTT); 500 pg/ml 
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bovine serum albumin (BSA) ) ; 3 »1 o.l M MgCl 2# 1.5^,1 10GATC 
(a solution containing 10 mM of each nucleotide G, A, T, 
and C), and 1 „1 Klenow (Boehringer Mannheim Biochemicals) .' 

The reaction mixture was incubated at room temperature for 
30 minutes. The reaction mixture was extracted with 
phenol/chloroform and chloroform isoamyl alcohol as 
described above, and then precipitated with two volumes of 
ethanol. 

The cDNA pellet was resuspended in 12 //l distilled 
water. To the resuspended linkers the following components 
were added: 5 //l EcoRI. phosphorylated linkers (New England 
Biolabs), 2 pi lox ligation buffer (0.66 M Tris.Cl pH=7. 6, 
50 mM MgCl 2 , 50 mM DTT, 10 mM ATP) and 1 fil T4 DNA ligase. 
The reaction was incubated at 14 °C overnight. The follow- 
ing morning the reaction was incubated at 67 »C for three 
minutes to inactivate the ligase, then momentarily chilled. 
To the ligation reaction mixture 2.5 *1 of 10 X high salt 
restriction digest buffer (Maniatis et al.) and 2.5 jil of 
BcoRI enzyme were added and the mixture incubated at 37«c 
for, at least 6 hours to overnight. . To remove excess 
linkers the digestion mixture was loaded onto a 1.2% 
agarose gel and the reaction components size fractionated 
by electrophoresis, size fractions of the 0.3-1.3 Kb and 
1.3-7 Kb ranges were electroeluted onto NA45 paper 
(Schleicher and Schuell) . The NA45 paper, with the eluted 
cDNA bound to it, was placed in a 1.5 ml microfuge tube 
containing 0.5 ml of elution solution (50 mM arginine, 1 M 
NaCl, pH=9.0). The tube was then placed at 67 °C for 
approximately one hour to allow the cDNA to be eluted from 
the paper into the solution. The solution was then 
phenol/ chloroform, chloroform/ isoamyl alcohol extracted and 
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precipitated with two volumes of ethanol. The resulting 
cDNA pellets were resuspended in 20 >1 TE (pH=7.5). 
(iv) Cloning of the cDNA into Lambda Vectors 
The linkers used in the construction of the cDNAs 
5 contained an EcoRI site which allowed for direct insertion 
of the amplified cDNAs into lambda gtlO and gtll vectors 
(Promega, Madison WI) . Lambda vectors were purchased from 
the manufacturer (Promega) which were already digested with 
EcoRI and treated with bacterial alkaline phosphatase, to 
10 remove the 5' phosphate and prevent self -ligation of the 
. . ■' vector. . 

The £co.RJ-linkered cDNA preparations were ligated into 
both lambda gt 10 and gtll (Promega). The conditions of the 
ligation reactions were as follows: l pi vector DNA 
15 (Promega, 0.5 mg/ml) ; 0.5 or 3 pi of insert cDNA; 0.5 pi 10 
X. ligation buffer (0.5 M Tris-HCl, pH=7.8; 0.1 M MgCl 2 ; 0.2 
M DTT; 10 mM ATP; 0.5 g/ml BSA) , 0.5 pi T4 DNA ligase (New 

England Biolabs) and distilled water to a final reaction 
volume of 5 pi. 

The ligation reaction tubes were placed at l4°C 
overnight (12-18 hours) . The ligated cDNA was packaged the 
following morning by standard procedures using a lambda DNA 
packaging system (GIGAPAK, Stratagene, LaJolla, CA) , and 
then plated at various dilutions to determine the titer and 
25 recombinant frequency of the libraries. A standard X-gal 
blue/white assay was used to screen the lambda gtll 
libraries (Miller; Maniatis et al.). E. coli HG415 (from 
Howard Gersenfeld, Dept.. of Pathology, Stanford School of 
Medicine) plating bacteria, which allows only plague 
30 formation by recombinant clones, was used for plating the 
lambda gt 10 libraries. The standard strain, E. coli C600hF- 
may be used as ah alternative to E. coli HG415. 
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Example 7 

• • s ™**toq the cDNA Tihrnrv for nr-nH^^ of 
antigens 

The five lambda gtll libraries generated in Example l 
were screened for specific HCV encoded viral antigens by 
immunoscreening. The phage were plated for plague forma- 
tion using the Escherichia coli bacterial plating strain E. 
coli KM392 (Kevin Moore, DNax, . Palo Alto, : CA) . Alterna- 
tively, E. coli Y1088 may be used., The fusion proteins 
expressed by the lambda gtll clones were screened with 
serum antibodies (Young et al.) from the following sources: 
chimpanzee #771 and various human PT-NANBH sera (including 
EGM, BV, WEH and AG) . 

From the lambda gtll libraries (Example 1) approxi- 
mately m independent clones gave a positive immunological 
reaction with at least one of the chimp or human PT-NANBH 
sera. These phage clones were plaque purified and the 
recombinant phage grown for DNA purification (Maniatis et 
al.) . - 
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Example 3 

Genomic Hybridization .So-eer^ n f i mnQnns< H ve clone* 

Out of, the ill plague purified recombinant phage, 
obtained as in Example 2, 93 were isolated (Maniatis et 
al.) and digested with EcoRI as per the manufacturer's 
instructions (Bethesda Research Laboratories , Gaithersburg, 
MD). Approximately i.o microgram of each digested phage 
DNA sample was loaded into sample wells of i.o% agarose 
gels prepared using TAE (0.04 m Tr is Acetate , o.ooi M 
EDTA) . The DNA samples were then electrophoretically 
separated. DNA bands were visualized by ethidium bromide 
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staining (Maniatis et al.) . Inserts were clearly identi- 
fied for each of the 93 clones, purified by electroelution 
using NA45, and then radioactively labelled by nick 
translation (Maniatis et al.) . 
5 Human peripheral blood lymphocyte (PBL) DNA was 

restriction digested with Hindlll and EcoRl, loaded on a 
0.7% agarose gel (as above, except 10 ng of DNA was loaded 
per lane) and the fragments separated edectrophoretically . 
The DNA fragments in the agarose gels were transferred to 
10 nitrocellulose filters (Southern) and the genomic DNA 
probed with the nick-translated lambda gtll inserts which 
were prepared above. ' 

The filters Were washed (Southern; Maniatis et al.) 
and exposed to X-ray film. Forty-three of the 93 lambda 
15 clone inserts displayed a positive hybridization reaction 
with the human PBL DNA. Among the remaining inserts which 
clearly did not hybridize with the PBL DNA, were 11 inserts 
derived from chimp #771 clones which were also clearly 
immunopositive from Example 2. of these 11 clones, two of 
20 the clones had the immunoreactive characteristics summa- 
rized in Table l. Chimpanzee #771 and humans Ag, BV and 
WEH were chronimc PT-NANBH sera samples and SKF was a 
normal human serum sample. 
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■ : _ . Table l 

■ Ser ^ * Clone Designation 

36 40 

#771 + + 

AG + • 

BV + . 
WEH 



SKF 
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Clone 40 (original clone screening designation 304-12-1) 
was clearly exogenous , i.e., not derived from normal human 
DNA, as evidenced by repeated hybridization tests against 
normal human PBL DNA, and a second clone, designated clone 
5 36 (original clone screening designation 303-1-4) , was not 
only exogenous but also reactive with multiple PT-NANBH 
antisera . 

- . Example A ' '. 
10 - Sequencing nfri 

DNA sequencing was performed on clones 36 and 40 as 
described in Example 3 . Commercially available sequencing 
primers (New England Biolabs) homologous to f lanking lambda 
sequences at the 5' and 3' ends of the inserts were 
i5 initially used for sequencing. As sequencing progressed 
primers were constructed to correspond to newly discovered 
sequences. Synthetic oligonucleotide primers were prepared 
using commercially available automated oligonucleotide 
synthesizers. Alternatively, custom designed, synthetic 
20 oligonucleotides may be purchased, .for example, from 
Synthetic Genetics (San Diego, CA) . 

DNA sequences were determined for the complete insert 
of clone 40 (presented as SEQ ID NO:l and also shown in 
Figure 3); this sequence corresponds to nucleotides 6516 to 
25 7070 of the HCV genome (Appendix). Subsequently, the 
inserts present in clones 44 and 45 (2 other clones of the 
11 clones identified in Example 3) were found to cross- 
hybridize to the clone 40 insert. Partial sequencing of 
clones 44 and 45 showed that the sequences obtained from 
these two clones matched the sequence of clone 40. A par- 
tial sequence of the clone 36 insert was determined and is 
presented as SEQ ID NO: 3; the complete sequence is pre- 
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sented as SEQ ID NO: 5 and is also shown in Figure 4. The 
sequence of clone 36 corresponds to nucleotides 5010 to 
6515 given in the Appendix. 

5 • - " . . 

Example 5 

Screening of the cDNA library in lambda at 10 
The cDNA libraries in lambda gtlO, generated in 
Example 1/ were screened for the presence of sequences 
10 homologous to the clone 40 insert. 

The lambda gtlO libraries were plated at a density of 
-approximately 10 4 plaques/plate and plaques lifts were 
prepared according to Maniatis et al. Filters were indexed 
using india ink to allow alignment of the f ilters with the 
15 parent plate from which the plaque lift was performed,. The 
bacteria and phage particles were lysed, and the nitrocel- 
lulose filters were processed and baked as previously 
described (Maniatis et al. ) . The prehybridization solu- 
tion, per filter, consisted of the following: 5.4 ml 
20 prehybridization buffer (50 ml of 1 M Tris HC1, pH=7.5; 2 
ml of 0.5 M EDTA, pH=8.0; 50 ml of 10% SDS; 150 ml of 20 X 
SSC (Maniatis et al.); and, 238 ml of glass distilled 
water); 6.0 ml formamide; 0.4 ml 50 X Denhardt solution (5 
g FICOLL; 5 g polyvinylpyrrolidone; 5 g bovine serum 
25 albumin; brought to a total volume of 500 ml with glass 
distilled water) ; and 0.2 ml of single-stranded salmon 
sperm DNA (10 mg/ml) . Each filter was placed in a plastic 
bag and the prehybridization solution was added. The bag 
was sealed and incubated at 37°C overnight with intermittent 
30 mixing of contents. 

The clone 40 lambda DNA was isolated (Maniatis et al. ) 
and digested with EcoRI. The resulting fragments were 
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fractionated on an agarose gel and visualized by ethidium 
bromide staining (Maniatis et al.). The DNA fragment 
corresponding to the clone 40 insert, approximately 500 
base pairs,, was isolated from the agarose by electroelution 
onto NA45. The aqueous suspension of the purified fragment 
was extracted once with a 1:1 phenol/ chloroform solution, 
and once with a 24:1 chloroform/isoamylalcohol solution. 
The DNA was then precipitated with ethanol and resuspended 
in sterile water. 

The clone 40 . insert was radioactively labelled by nick 
translation and used to probe the lambda gtio plague lift 
.filters. The prehybri'dization solution was removed from 
the filters. Each filter was hybridized with probe under 
the following conditions: 5.0 ml of hybridization buffer 
15 (5 ml of 1 M Tris HC1, pH=7.5; 0.2 ml of 0.5 M EDTA, 
pH=8.0; 5.0 ml of 10% SDS; 14.9 ml of 20 X SSC (Maniatis et 
al.); io g of dextran sulfate; and, glass distilled water 
to a total volume of 50 ml) ; 5.0 ml formamide; 0.4 ml 50 X 
Denhardt's solution (5 g FICOLL; 5 g polyvinylpyrrolidone; 
5 g bovine serum albumin; brought to a total volume of 500 
ml with glass distilled water); and 0.2. ml of single- 
stranded salmon sperm DNA (10 mg/ml) . To this hybridiza- . 
tion mix was added 50-250 m of denatured probe (boiled 5- 
10 minutes and quick-chilled on ice) > resulting in approxi- 
25 mately io 6 cpm of labelled probe per filter. The hybridiza- 
tion mix containing the labelled probe was then added to 
• the plastic bag containing the . filters. The bag . was 
resealed and placed under a glass plate in a 37°c water bath 
overnight with intermittent mixing of contents . . 

The next day the hybridization solution was removed 
and the filters washed three times, for 5 minutes each, in 
2 X SSC (Maniatis et al.) containing 0.5% SDS, at room 
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temperature. The filters were then washed for one hour in 
2 X SSC, containing 0.5% SDS, at 50°c. The filters were 
then washed for 15-60 minutes in 0.1 X SSC, containing 0.1% 
SDS, at 50°C and finally 2 X SSC/ 15 minutes, 2-3 X at room 
5 temperature. The washed filters were dried and then 
exposed to X-ray film for detection of positive plaques. 

Approximately 24 plaques from the lambda gtio librar- 
ies were plaque purified from the approximately 200 plaques 
which tested positive by the hybridization screen (Table 

io 2) . . , - . ' ' ". 

••' ' , . ' Table 2 ' ' 

Library '. cDNA Positives/ 

15 - _Source Plate 

EGM Human s50 

BV Human = 100 

20 , 

WEH Human s25 

#771 Chimp slO-15 

Example 6 

Analysis of lambda crtlQ cD NA Library Clones Homologous ' 

to the Clone 40 insert 
30 The clones identified in Example 5 which have homology 

to the clone 40 insert were analyzed by, standard restric- 
tion analysis and the insert sizes were determined. The 
original frequencies of positive hybridization signals per 
plate using the clone 40 insert as probe against the 
35 different cDNA sources are shown in. the last column of 
Table 2. That these positive signals arose with different 
frequencies for the different cDNA sources in the lambda 
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gtlO library suggests that the hybridization signals 
originated from the sera source rather than common contami- 
nation introduced during cDNA synthesis or cloning. 

; One of the clones (108-2-5) from the EGM-generated 
cDNA library identif ied by hybridization with the clone 40 
insert, had an insert of approximately 3.7 kb and was 
chosen for further analysis. The insert was isolated by 
EcoBI digestion of the clone, electrophoretic fraction- 
ation, and electroelution. (Example ,5) .' The insert was 
: treated with DNase I under conditions resulting in partial 
digestion (Maniatis et al.) to generate random fragments. 
- - The resulting fragments were inserted into lambda gtll 
vectors for expression. The lambda gtll clones were then 
immunoscreened (Example 2) using human (BV and normal) and 
chimpanzee #771 sera. Twelve positive clones were identi- 
fied by first round immunoscreening with the human and 
chimp sera. Seven of the 12 clones were plague purified 
and rescreened using chimp serum (#771). Partial DNA 
sequences of the insert DNA were determined for two of the 
resulting clones that had the largest sequences, designated 
328-16-1 and 328-16-2. The 2, clones had sequences essen- 
" tially identical to clone 40. 

■ Example 7 • j ' ■ 

25 * Preparing Amplifi ed HCV cDNA Fragments 

A. Preparing cDNA fragments 

A plasma pool obtained from a chimpanzee with chronic 
PT-NANBH was obtained from the Centers for Disease Control 
(CDC) (Atlanta, GA) . After direct pelleting or PEG 
30 precipitation, RNA was extracted from the virions by 
guanidinium thiocyanate-phenol-chloroform extraction, 
according to published methods (Chomczynski) . The pelleted 
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RNA was used for. cDNA synthesis using oligo dT. or random 
primers, or HCV sequence-specific primers and a commercial 
cDNA kit (Boehringer-Mannheim) . 

In one method, synthesis of first strand cDNA was 
5 achieved by addition of four primers, designated A, B, c, 
and D, having the sequences shown below. These sequences 
are complementary to the HCV genomic regions indicated: 
, A: 5 ' -GCGGAAGCAATCAGTGGGGC-3 ' , complementary to . basepairs 

394-413; 

10 B : 5 ' -GCCGGTCATGAGGGCATCGG-3 ' , complementary to basepairs 
2960-2980; 

C: 5 '-CGAGGAGCTGGCCACAGAGG-3', complementary to basepairs 
5239-5258; and 

D : 5 ' -TGGTTCTATGGAGTAGCAGGCCCCG-3 ' , comp lementary to 
15 basepairs 7256-7280. 

Second strand cDNA synthesis was performed by the 
method of Gubler and Hoffman. The reactions were carried 
out under standard cDNA synthesis methods given in the 
commercial kit. 

20 . .-" .-■ : , . \\v • ,' " .. ■: ■ 

B. Amplifying the cDNA Fragments 

The cDNA from above was blunt ended and ligated to the 
linker/primer having the following sequence: 

25 Linker /primer: 5'-GGA ATT CGC GGC CGC TCG-3 ' A-strand 

3'-TT CCT TAA GCG CCG GCG AGC-5' B-strand 
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The cDNA and linker , were mixed at a 1:100 molar ratio in 
the presence of 0.3 to 0.6 Weiss units of T4 DNA ligase. 
To 100 pi of 10 mM Tris-Cl buffer, pH 8.3, containing l . 5 
mM MgCl 2 and 50 mM KC1 (Buffer A) was added about 1 x 10-3 
/jg of the linker-ended cDNA, 2 fM of linker /primer A (A- 
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strand ) having the sequence d ( 5 ' -GGAATTCGCGGCCGCTCG-3 ' ) 
200 pK. each of dATP, dCTP, dGTP, and dTTP, and 2.5 units of 
Thermus aquatjcUs DNA polymerase (Taq polymerase). The 
reaction mixture was heated to 94°C for 30 sec for denatur- 
5 ation, allowed to cool to 50°C for 30 sec for primer 
annealing, and then heated to 72°C for 0.5-3 minutes to 
allow for primer extension by Tag polymerase. The replica- 
tion reaction, involved successive heating, cooling, and 
polymerase reaction, was repeated an additional 25 times 
10 with the aid of a Perkin-Elmer Cetus DNA thermal cycler. 
This results in a pool of SISPA (sequence-independent 
: single primer amplification) -amplified DNA fragments. 

Example a 

15 . * Preparin g Primer-Pair Fragments 

Amplified cDNA fragments from Example" 7 were mixed 
with 100 //l. Buffer A, 1 (M of equal molar amounts of one of 
the primer pairs given below, 200 //M each of dATP, dCTP, 
dGTP , and dTTP , and 2.5 units of Thermus aouaticus DNA 

20 -polymerase (Taq polymerase) . Each primer pair includes a 
forward, (upstream) primer Fj which is identical to the 
coding strand at the upstream end of an overlap region P; of 
duplex genomic DNA and a reverse primer Rj which is comple- 
mentary to the coding at the downstream end of the region 

25 Pj. The sets of primers each define an overlap region of 
about 200 basepairs, and the spacing between adjacent 
overlapping primer regions (i.e., between adjacent pairs of 
F./R; pairs) is about 0.5-1 kilobase. The regions of HCV 
which are complementary to the primers are given below: 

30 F lf basepairs 183-201; R,, basepairs 361-380 
F, 0 , basepairs 576-595; R, 0 , basepairs 841-860 
F 2 , basepairs 1080-1100; R 2 , basepairs 1254-1273 
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F 3 , basepairs 1929-1948; R 3/ basepairs 2067-2086 
F 4 , basepairs 2754-2733; R«, basepairs 2920-2940 
F 5 , basepairs 3601-3620; Rj, basepairs 3745-3764 
F 6 , basepairs 4301-4320; R«, basepairs 4423-4442 
.5 F l2/ basepairs 4847-4865; R 12 , basepairs 4715-4734 
F 7 , basepairs 5047-5066; R 7 , basepairs 5200-5216 
F g/ basepairs 5885-5904; R 8/ basepairs 6028-6047 
F 9/ basepair^ 6902-6921; R,,. basepairs 7051-7070 

Polymerase Chain Reaction (PCR) amplification of the 
10 SISPA-amplif ied cDNA fragments with each Fj/Rj primer pair 
was carried out under conditions similar to those used 
above , with about 25 cycles. 

The amplified fragment mixtures from above were each 
fractionated by electrophoresis on 1.5% agarose and 
15 transferred to nitrocellulose filters (Southern) . Hybrid- 
ization of the nitrocellulose-bound fragments/ each with an 
internal-sequence oligonucleotide probe confirmed that each 
fragment contained the expected sequences. Hybridization 
was carried out with an internal oligonucleotide radiola- 
2 0 beled by polynucleotide kinase, according to standard 
methods. 

Example 9 
Prepari ng Linking Fragments 
25 This example describes preparing large overlapping 

linking fragments of the.HCV sequence. SISPA-amplif ied 
cDNA fragments from Example 7 were mixed with 100 jjI Buffer 
A, 1 fM of equal molar amounts of forward and reverse 
primers in each of the primer pairs given below, 200 pK each 
30 of dATP, dCTP, dGTP, and dTTP, and 2.5 units of Thermus 
aquaticus DNA polymerase (Taq polymerase) , as in Example 8. 
........ Each primer pair includes a forward primer Fi and a reverse 
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primer Rj, where F; is the forward primer for one overlap 
region Pi, and is the. reverse primer of the adjacent 
overlap region/ Thus each linking fragment spans two 
adjacent overlap regions. The sets of primers each define 
5 a linking fragment of about 0.5-1 kilobases. The sequences 
of the primer pairs are given in Example 8. the overlap- 
ping linking fragments of the HCV sequence (Appendix) 
spanned by each primer pair is given below: ^ : ; 
F i/ R io/ basepairs 183-860 

10 F 10 /R 2/ basepairs 576-1273 
F 2 /R 3/ basepairs 1080-2086 
F 3 /R 4/ basepairs 1929-2940 ; v 
F 4 /R5, basepairs 2754-3762 
F 5 /R<5, basepairs 3601-4442 

15 F 6 /R 12/ basepairs 4301-4865 
F 12 /R 7 , basepairs 4715-5216 
F 7 /Rg, basepairs 5047-6047 
F^/Eg, basepairs 5885-7070 

20 Two-primer amplification of the SISPA-amplified cDNA 

fragments with each Fj/Rj primer pair was carried out under 
conditions similar to those described above, with about 25 
cycles. 

The amplified fragment mixtures from above were each 
25 fractionated by agarose electrophoresis on 1.2 % agarose, 
and transferred to nitrocellulose filters (Southern) for 
hybridization with radiolabeled internal oligonucleotide 
probes as above. The analysis confirmed that each linking 
fragment contained the two end-primer sequences from 
30 adjacent overlap regions. The sequences contained in each 
of the linking fragments are indicated in the Appendix. 
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Example 10 
Preparin g Cloned Peptide Fragments 
A. DNA Fragment Digestion 

Each of the ten linking fragments from Example 9 was 
5 suspended in a standard digest buff er (0.5M Tris HC1, pH 
7.5; 1 mg/ml BSA; lOmM MnC12) to a concentration of about 
1 mg/ml and digested with DNAse I at room temperature for 
various times (1-5 minutes). These reaction conditions 
were determined from a prior calibration study, in which 
; 10 the incubation time required to produce predominantly 100- 
300 basepair . fragments was 

determined. The material was extracted with phe- 
nol/chloroform before ethanol precipitation. 

The fragments in the digest mixture were blunt-ended 
15 and ligated with EcoRI linkers. The resultant fragments 
were analyzed by electrophoresis (5-lOV/cm) on 1.2% agarose 
. " - gels, using PhiX174/HaeIII and lambda/Hindlll size markers. 
"• The 10.0-300 bp fraction was eluted onto NA45 strips 
(Schleicher and Schuell) , which were then placed into 1.5 
20 ml microtubes with eluting solution (1 M NaCl, 50 mM 
arginine, pH 9.0), and incubated at 67°C for 30-60 minutes. 
The eluted DNA was phenol/ chloroform extracted and then 
precipitated with two volumes of ethanol. The pellet was 
resuspended in 20 /xl TE buffer (0.01 M Tris HC1, . pH 7.5, 
25 0.001 M EDTA) . ^ 

B. Cloning the Digest Fragments 

Lambda gtll phage vector (Young et al.) was obtained 
from Promega Biotec (Madison, WI) . This cloning vector has 
30 a unique EcoRI cloning site 53 base pairs upstream from the 
beta-galactosidase translation termination codon. The 
partial digest fragments from each linking fragment in Part 
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A were introduced into the EcoRl site by mixing 0.5-1.0 pg 
-ScoKT-cleaved lambda gtll, 0.3-3 pi of the above sized frag- 
ments, 0.5 „1 10X ligation buffer (above) , 0.5 M l DNA ligase 
(200 units) , and distilled water to 5 ,1. The mixture was 
incubated overnight at 14«C, followed by in vitro packag- 
ing, according to standard methods (Maniatis, pp.' 256-268) . 

The packaged phage were used to infect E. coli strain 
KM392, obtained from Dr. Kevin Moore, DNAX (Palo Alto, CA) . 
Alternatively, E. coli strain Y1090, available from the 
American Type Culture Collection . (ATCC No . 37197), could be 
used. A lawn of KM392 cells infected with about 10 3 -10 4 pfu 
of the phage stock from above was prepared on a 150 mm 
plate and incubated, inverted, for 5-16 hours at 37»C. The 
infected bacteria were checked for loss of beta-galactosi- 
dase activity (clear plaques) in the presence of x-gal 

using a standard X-gal substrate plague assay method 
(Maniatis) 

Identification of single plagues containing a digest- 
fragment insert was confirmed as follows. Clear single 
plaques (containing the progeny of a single phage) were 
removed from the plate and suspended in extraction buffer 
(Maniatis) to release the phage DNA. The phage extract was 
added to the above DNA amplification mixture in the pre- 
sence of primers which are about 70 basepairs away in ei- 
ther direction from the EcoRl site of . lambda gtll. Thus 
phage containing a digest- fragment insert will yield an 
amplified digest fragment of about 140 basepairs plus in- 
sert. Phage DNA amplification was carried out as described 
above, with 25 cycles of amplification. The reaction mate- 
rial from each plaque tested was fractionated on 1.5% aga- 
rose, and examined for the size of amplified digest frag- 
ments. Non-recombinant phage gave a 140 basepair band, and 
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recombinant phage, a band which is 140 basepair plus the 
" insert sequence in size. The results are shown in column 
2 (REC Freq) of Table 3 below, for the six linking-fragaent 
libraries indicated in the -first column in Table 3 below. 
5 The denominator in the column-2 entries is the total number 
of plagues assayed by primer amplification. The numerator 
is the number of clear plaques containing fragment inserts. 
Thus, 3/15 means that 3 plaques tested positive by PCR out 
.. of a total of 15 clear plaques assayed. 
10 . Table 3 

• Library 1 REC Freer 2 1° Screen 3 PA /REC 4 

F2R3 #2 3/15 2 , 0.33 

. F3R4 #1 7/12 o 

15 F4R5 #3 9/10 . 10 0 37 

F5R6 #5 11/12 3.7 1.35 

F7R8 #7 0/12 l _ 

F8R9 #10 3/12 58 7.73 

20 '-Libraries constructed by partial DNasel Digestion of indicated 

linking clone 

V Recombinant frequency determined by PCR with insert flanking lambda 
gtll primers 

' ■■ '".Primary screening with chronic human PT-NANBH serum (1:100) on 1.5X10 
Zo phage , 

4 - PA/REC indicates the number of positive areas detected per actual 
number of recombinant plated ■ 
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The library of digest fragments constructed for each 
linking fragment was screened for expression of peptides 
which are immunoreactive with a human PT-NANBH serum. The 
lawn of phage-infected bacteria was overlaid with a nitro- 
cellulose sheet, transferring PT-NANBH recombinant peptides 
from the plaques to filter paper. The plate and filter 
35 were indexed for matching corresponding plate and filter 
positions. 

The filter was removed after 6-12 hours, washed three 
times in TBS buffer (10 mM Tris, pH 8.0, 150 mM NaCl) , 
blocked with AIB (TBS buffer with i% gelatin) , washed again 
40 in TBS, and incubated overnight with of antiserum (diluted 
to 1:100 in AIB, 12-15 ml/plate). The sheet was washed 
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twice in TBS and then incubated with alkaline-phosphatase- 
conjugated anti -human IgG to attach the labeled antibody at 
filter sites containing antigen recognized by the anti- 
serum. After a final washing, the filter was developed in 
5 a substrate medium containing 33 „1 NBT (50 mg/ml stock 
solution maintained at 4°C) mixed with 16 M l BCIP (50 mg/ml 
stock solution maintained at 4»C) in 5 ml of alkaline phos- 
phatase buffer (loo mM Tris, 9.5, loo mM NaCl, 5 mM Mgci2) . 
Reacted substrate precipitated at" points of antigen produc- 
10 tion, as recognized by the antiserum. 

The total number of plagues which showed antigen-posi- 
tive reaction (positive areas PA) in the primary screen are 
given in the third column in Table 3. The fourth column in 
the table is the frequency of positive areas per total num- 
ber of recombinant phage screened (x 10 3 ) . This last column 
is therefore a measure of the relative immunogenicity of 
antigen expressed from a particular linking fragment using 
this particular serum sample. 

20 . Example n 

Screening Dicresfh Fragments 

The digest-fragment libraries of each of the ten link- 
ing fragments from Example 9 were screened with sera from 
a human patient with chronic PT-NANBH and with pooled sera 
5 from chimpanzees with acute PT-NANBH infection and chronic 
PT-NANBH infection. Individual chronic and acute chimpan- 
zee sera from 5 chimpanzees were obtained from the Centers 
for Disease Control. 

The digest-fragment libraries from the linking frag- 
10 ments indicated in Table 4 below were screened with each of 
the three sera, using the screening procedure described in 
Example 10. The total number of positive areas observed in 
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each plate (making up one fragment library) is given in the 
table. The entries in the table which are not in parenthe- 
ses represent the number of positive areas which were 
confirmed by plague purification, i.e., by replating 
5 plagues from the positive areas at low dilution and con- 
firming a positive area (secondary screen) . Typically 
about 90-95 percent of the positive . areas in the primary 
screen tested positive by secondary screening. The entries 
in parentheses indicate positive areas which have not been 
10 confirmed in a secondary screen. 

As seen from Table 4, all but one of the linking frag- 
ment libraries contained seguences encoding peptide 
antigens which are immunoreactive with either chronic human 
or chimpanzee infected sera. Five of the libraries contain 
15 seguences encoding antigens which are immunoreactive with 
acute sera, indicating that one or more of the antigens in 
this group are effective to detect acute-infection serum. 
Three of these latter libraries — Fj/R,, F 12 /R„ and F 7 /R g — 
gave over 10 positives in each library. These data are not 
corrected f or the recombinant f reguency in . a particular 
library and therefore do not reflect the comparative 
-immunogenicity of the various linking fragments. 
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Table 4 

Acute Pool 
P.P. Clones 



F1R10 


0 


F10R2 


4 ■ 


F2R3 


4 


F3R4 


0 


F4R5 


5 


F5R6 


34 


F6R12 


(400) 


F12R7 


2 


F7R8 


0 


F8R9 


60 



Chronic Pool 
2 . P . Clones 



0 
2 
o 

10 
0 
0 

. 5 
17(200) 
20 
0 



( ) 
P.P. 

Acute Pool 

Chronic Pool 



not plaque purified 
Plaque Pure 
CDC Panel of Chimps 
CDC Panel of Chimps 



0 . 
. 4 : 

1 
10 
7 

(42) 
10(200) 
. 9(200) 
10(130) 

r 



25 



30 



35 



A. 



Example i? 

Immunoscreeninrf f or AnQ-l-i-Antiq gn 

Plaque Immunoscreening 

Several clear plaques identified in the primary screen 
of the F 4 /Rj linking fragment were replated and plaque puri- 
fied. One of the purified plaques was designated gtll/409- 
l-l(c-a). The digest fragment contained in clone 409-1- 
l(c-a) corresponds to two sets of base pairs present in the 
HCV genome and present in clone 409-1-1 (abc) . For ease of 
reference three regions (a, b, and c, ) have been designated 
in the 409-1-1 (abc) clone (see below and Figure 5). The 
longest homology of base pairs corresponds approximately to 
nucleotides 2754 to 3129 of the Appendix (the -a" region, 
see Figure 5, region delineated by boxes) and the shorter 
homology corresponds approximately to nucleotides 3242 to 
3311 of the Appendix (the «c" region, see Figure 5): nor- 
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. mally the "c M region is located approximately 112 nucleo- 
tides distal the 3' end of the "a"' region (see Figure 5). 
The complete sequence of the gtll/409-l-l(c-a) insert is 
given in Figure 6 and presented as SEQ ID NO: 7. This clone 
5 arose through a ligation event between two independent 
DNasel fragments generated from the F 4 /R, linking clone and 
has ATCC No. 40792. A related clone, designated 409-1- 
l(abc), has been described in co-owned patent application 
'"" Ser. No. 505,611 and has ATCC No. 40876. 

10 A lambda gtll clone corresponding to the immunoreac- 

tive sequence reported in the EPO application 88310922.5, 
and designated 5-1-1, was prepared by primer-specific 
amplification of the amplified cDNA fragments generated in 
Example 7. The 5-1-1 sequence corresponds to basepairs 
15 3730-3858 of the HCV sequence (Appendix), in the linking 
fragment Fs/R^. The primers used for fragment amplification 
are 20 basepair oligomers complementary to the forward and 
reverse sequences of the 3732-3857 basepair 5-1-1 sequence. 
Both oligomers have EcoRI sites incorporated into their 
.20 ends and the forward oligomer is designed to ensure a con- 
tiguous open reading fram with the beta-galactosidase gene. 
The amplified 5-1-1 sequence was purified by agarose gel 
electrophoresis, and cloned into lambda gtll phage. Ampli- 
fication and cloning methods were as described above. 
25 -Phage containing the 5-1-1 sequence were identified and 
; purified by primary and secondary screening, respectively, 
with human PT-NANBH serum, also as described above. 

The purified gtll/409-l-l(c-a) and gtll/5-l-l clones 
were each mixed with negative lambda gtll phage, plated and 
30 immunoscreened with a number of different donor sera from 
..normal and NANBH-inf ected humans and chimpanzees, as indi- 
cated in Table 5 below. Each plate was divided into seve- 



WO 91/15516 



PCI7US91/02370 



10 



15 



64 

ral equal-area sections, and the corresponding sections on 
the nitrocellulose transfer filter were separately screened 
with the donor sera indicated, using the immunoscreening 
method described in Example 11. The number of positives 
detected for each group of sera by the 5-1-1 and 409-1-1 
(c-a) peptides are shown, as well as a comparison with the 
C-100 test in the ELISA format, in Table 5. 

Table 5 .. - 
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B. Western Blot Screening • 

For Western blot screening, gtll/409-1-1 (c-a, phage from Example 
11 was used to infect E. coli BNN103 temperature-sensitive bacteria. 
These bacteria were obtained from the American Type Culture Collection. 
The bacterial host allows expression of a beta-galactosidase/peptide 
antigen fused protein encoded by the vector under temperature induction 
conditions (Hunyh) . 

Infected bacteria were streaked, grown at 32"C overnight or until 
colonies were apparent, and individual colonies were replica plated and 
examined for growth at 32°c and 42°C. Bacterial colonies which grew at 
32-C, but not 42-C, indicating integration of the phage genome, were 
used to inoculate 1 ml of NZYDT (Maniatis) broth A saturated overnight 
bacterial culture was used to inoculate, a 10 ml culture, which was 
incubated with aeration to an O.D. of about .2 . to .4, typically 
requiring 1 hour incubation. The culture was then brought to 43" C 
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quickly in a 43«C water bath and shaken for 15 minutes to induce lambda 
gtll peptide synthesis, and incubated further at 37»C for 1 hour. 

The cells were pelleted by centrifugation, and 1 ml of the 
pelleted material was resuspended in 100 „1 of lysis buffer (62 mM 

5 Tris, pH 7.5 containing 5% mercaptoethanpl , 2.4 % SDS and 
10% glycerol). Aliguots (about 15 M l) were loaded directly 
onto gels and fractionated by SDS-PAGE. After electropho- 
resis, the fractionated bands were transferred by electro- 
elution to nitrocellulose filters, according to known" 
10 methods (Ausubel et al. ) . 

The lysate was treated with DNasel to digest bacterial 
DNA, as evidenced by a gradual loss of viscosity in the. 
lysate. An aliquot of the material was diluted with Triton 
X-l00™and sodium dodecyl sulfate (SDS) to a final concen- 
15 tration of 2% Triton X-100™ and 0.5% SDS. Non-solubilized 
material was removed by centrifugation and the supernatant 
was fractionated by SDS polyacrylamide electrophoresis 
(SDS-PAGE) .PAGE, A portion of the gel was stained, to 
identify the peptide antigen of interest, and the corre- 
sponding unstained band was transferred onto a nitrocellu- 
lose filter.. 

The 5-1-1 antigen coding sequence (Example 11) was 
also expressed as a glutathione-S-transf erase fusion 
protein using the pGEX vector system, according to pub- 
lished methods (Smith). The fusion protein obtained from 
bacterial lysate and fractionated by SDS-PAGE were trans- 
ferred to a nitrocellulose filter for Western blotting, as 
above. 

Western blotting was carried out substantially as 
described in Example 10. Briefly, the filters were blocked 
with AIB, then reacted with the serum samples identified in 
Table 5, including human and chimpanzee normal, chronic 
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NANBH, and hepatitis B (HBV) sera sample. The presence of 
specific antibody binding to the nitrocellulose filters was 
assayed by further immunobinding of alkaline-phosphatase 
labelled anti-human IgG. The results of the Western blot 
analysis with the Sj26/5-l-i fusion protein and /409-l-i (c - 
a) fusion proteins are shown in Table 6 . The data confirm 
that 409-1-1 (c-a) and 5-1-1 peptide antigens are specifi- 
cally immunoreactive with human and chimpanzee NANBH 
antisera . 

■ Table 6 
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# Positive 



Human 


Normal 


2 


Human 


NANB 


7 


Human 


HBV 


1 


Chimp 


Normal 


5 


Chimp . 


NANB 


6 


Chimp 


HBV 


1 



Sj26 
5-1-1 

0 

5 

0 

0 

5 

0 



£-gal 
409-1-1 i c-a) 

0 . 

5 . 

0 

0 

5 

0 . 



Example 13 ; " ' 
Generation of Al ternative Clones 
Alternative clones were generated from the region iden- 
tified in Example 12 as encoding antigen specif ically immu- 
noreactive with human and chimpanzee NANBH antisera. The 
primers shown in Table 7 were selected from the HCV or 409- 
1-1 (abc) coding sequences to generate a variety of over- 
lapping clones. 
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grimer 

33C-F1 
33C-R1 

409-1-1 (c-a)F 

409-1-1-F1 

409-1-1-R2 

409-1-1-F3 

409-1-1-R5 

409-l-l(c-a)R 

409-1-1CR 



67 
Table 7 



Sequence ' 

CCGAATTCGCGGTGGACTTTATCCCTGT 

CCGAATTCCAGAGCAACCTCCTCGATG 

CCGAATTCCGCACGCCCGCCGAGACTAC 

CCGAATTCTCCACCACCGGAGAGATCCC 

CCGAATTCCACACGTATTGCAGTCTATC 

CCGAATTCGTCACCCAGACAGTCGAT 

CCGAATTCCCCTCCCAAAATTCAAGATGG 

CCGAATTCGCCAGTCCTGCCCCGACGTT 

CCGAATTCGTCCTGGCACACGGGAAG 
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The primers shown in Table 7 were used in DNA amplifica- 
tion reactions as described in Examples 7B and 8: the pri- 
mers and templates used in each reaction are shown in Table 
8. The amplified fragments were then treated with the 
Klenow fragment of DNA polymerase I, under standard condi- 
tions (Maniatis et al.)/ to fill in the ends of the mole- 
cules. . The blunt-end amplified fragments were digested 
with EcoRI under standard conditions and cloned into lambda 
gtll expression vectors essentially as described in Example 
10B. The resulting inserts are. aligned for comparison in 
Figure 7. 
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Generated Fragment 

33C 
33CU 

409-1-1 (F1R2) 
409-l-l(a) 
409r-l-l{c) 
409-1-1 (c+270) 
409-1-lu 



Table 8 

Template 

cDNA" 
cDNA" 

gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 
gtll 409-1-1 (c-a) 



Primers • 

33-C-F1 and 409-1-1-R2 
33-C-F1 and 33-C-R1 
409-1-1-F1 and 409-1-1-R2 
409-1-1-F1 and 409-1-lcaR 
409-1-lcaF and 409-1-1CR 
409-1-lcaF and 409-1-1-R2 
409-1-1-F3 and 409-1-lcaR 



* Amplified cDNA fragments from Example 7 
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Example ;i3 

• Immunoscreening of the Alternative Clones 

The alternative clones generated in Example 12 were im- 
munoscreened essentially as described in Example 10B. 
5 Clones 409-1-1 (abc) and 409-1-1 (c-a) , generated in Example 
12, were also included in the following immvmoscreenings.. 
The results of the preliminary immunoscreening are shown in 
Table 9. 

10 • :' Table 9 

GLI-1 ' FEC 



33C 

15 33 

409-1-1 (abc) + 

409-1-1 (F1R2) + 

409-1-1 (a) , + 

409-1-1 (ca) + 

20 409-1-1 (C) - 

409-1-1 (c+270) + 

409-1-1 u - 



+ ND* 

ND 
ND 
ND 
ND 
ND 

ND 



The GLI-1 sera was a human chronic PT-NANBH sera. If a 
clone tested negative with GLI-1 it was further examined by 
screening with FEC, a human chronic PT-NANBH sera. 

The seven of the 9 alternative clones which tested posi- 
tive by the above preliminary immunoscreening were more ex- 
tensively screened against a battery of sera. In addition, 
clone C100 (see Background) was included in the screening. 
The results of this more exhaustive screening are presented 
in Table 10. 
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Table 10 



Serum 



ANTIGEN 











Ana. i 1 

•K/5J- 1-1 


-♦Q9-1-1 


409-1-1 


409-1-1 


409-1 


-1 5 




CI 00 33C 


OOL>U 


a oc 


FIR2 


a 


c + 270 


ca 


SKF(-) 










* 


* 


* 


■ — 


FEC( + ) 


"f* 


+ O 


* + 3 


+ 1 


+ 2 




• 


+ 2 


BV 




+ 2 


+ 3 


i 

■ 1 


+ 1 


+ 1 


- t 


+ 1 , 


- 


Bar 




+ 2 


+ 2 


1 


* 


• - _ 






- 


OOt \ 

- rri-} 




* 








* 




- 


- 


Ar 




+ 1 






■ ■ 






1 " 




Cr 


T. 


+ 2 






+ 3 


+ 3 


\ 


+ 3 

" ■ 


\ +2 


1 
2 


* 
* 






* 


• 
* 


* 


• 


" ■ 
* 


* 


3 












* 


- * 


• 


1 


4 ' 








1 


I 


J . - 


- . 




1 


, 5 


• 


• 


+ 1 




* 




* 


* 


* 


6 




+ 1 


+ 3 


+ 1 


+ 1 


+ 1 




+ 1 


+ 1 


7 




+ 2 


+ 3 


+ 1 


+ 2 


+ 2 




+ 2 


+ 1 


38 






1 


+ 1 


1 


1 






1 


39 






+ 1 


J 


+ 1 






1 


1 


40 


+ 


+ 1 


+ 2 


+ 1 


+ 1 




1 


+ 1 . 


+ 1 


41 


+ 


+ 2 


+ 3 


+ 1 


+ 1. 


+ 1 




+ 2 


+ 1 . 


42 




+ 2 


+ 3 


+ 1 


+ 1 


+ 1 




+ 2 


+ 1 


43 




















44 




I 


I 


• 


* 


> 

* 






• 


45 


• * 


I 


+ 1 


I 


I 


* 






I 


46 


+ 


+ 1 


+ 2 


+ 1 


+ 2 


+ 1 




+ 1 


I 


47 


+ ; 


+ 1 


+ 2 


+ 2 . 


+ 2 * 


+ 3 




+3 


+1 


B18 




+3 


+3 


+1 


+3 


+3 ■'. 




+3 




A7 




+3 


+3 




+1 


+3 




+3 


+3 


C7 




+2 


+3 














A3 




+3 


+3 


+1 


+2 


+1 




+2 




B7 




+2 


+3 


I . 


+3 


+3 




; +3 


I 


C12 


+ 


+2 . 


+3 















The serum samples used for screening were identified as 
follows: SKF, PT-NANBH negative; FEC, PT-NANBH positive; 
BV, community acquired NANBH; Bar, PT-NANBH positive; PP 
(pre-inoculation pooled chimpanzee serum) , . PT-NANBH 
negative; AP (acute HCV pooled chimpanzee serum), PT-NANBH 
positive; and, CP (chronic HCV pooled chimpanzee serum) PT- 
NANBH positive. The numbered serum samples correspond to 
human clinical serum samples which were PT-NANBH positive. 
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The PP, CP, and AP sera were pooled sera samples from 5 

different chimpanzees: the chimpanzee serum samples were 

obtained from the Centers for Diseas.e Control. The scoring 

system presented in Table 10 is a qualitative scoring 

system defined as follows: (-) , a clear negative ; (+) , 

(2+), (3+), increasing strength of. positive signal, 

with (3+) being the strongest signal; and (I)' stands for 

Indeterminate, where two readings were different and not 
•repeated.. 

In view of the data presented in Table 10 the sensitivi- 
ty of the antigens in terms of immunoscreening is 33 cu > 

33c > 409-1-1 (c-a) > 409-1-1-F1R2 > 409-1-1 (abc) * 409-1-la 
> 5-1-1 > 409-1-1- (c+270). Although 33 cu and 33c were sen- 
sitive antigens, they reacted with high background against 
all sera. Accordingly, the 409-1-1 series are more useful 
as diagnostic antigens since they are more specif ic to HCV 
induced antibodies. 

The immunoscreening was further extended to include the 
clone 3 6 and 45 (corresponds to clone 40) encoded epitopes 
20 which were identified above. Table 11 shows the results of 
the immunoscreening. 



Table 11 

PANEL I: SEROCONVERSION SPECIMENS 



ANTIGEN 

C-100 33C 5.1.1 409-1-1 36 45 gtll 

„_ ■ ' ( c-a) 

gj" 1 + 2+ T+ "3+ T- 

FEC + 4+ 3+ 4+ 3+ 



10 BV . - . 3+ 

SKF(norm) - - 

1- N01/D69 - I 

2- "/D124 - + 



3+ 



3- W /D146 

4- »/D211 - 

5- N00/D22 
5 6- M /D29 

7- "/D41 

8- «/D60 , - 

9- "/D137 ■ + 

10 10-N240/D0 



15 



11- 


"/D45 


12- 


H /D71 - 


13- 


"/D89 


14- 


"/D106 - 


15- 


"/D155 - 



I 


— • 




+ 


. — 


_ 


+ ' 


I 


I 


2+ 


+ . 


2+ 


3+ 


2+ 


3+ 


4+ ' 


3+ 


4+ 


4+ 


'4+ 


4+ 


I 




I 








I 




T 


I 







I 
I 



16-N228/D0 - 


'I 






17- »/D3i - : 


I 






18- »/D41 - 


I 






20 19- «'/D51 - 


I 






' 20- "/D73 


I 






21- "/D93 - 








22- "/D127 - 









25 23-N192/D114 

24- "/D184 

25- "/D224 

26- "/D280 

30 27-N176/D0 
. 28- "/D66 

29- "/D77 

30- W /D94 

31- "/D200 

35 

32- N170/D0 • 

33- "/D27 ■ 

34- »»/D49 - 
40 35- "/D64 

36- »/D183 - 

37- »/D278 - 
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SERUM 

gtll 



C-100 33C 5.1.1 



ANTIGEN 
409-1-1 



36 45 



49-N31/D0 



50- 
51- 
52- 
53- 
54- 



56- 



"/D140 
"/D154 
"/D170 
"/D210 
"/D266 
"/D336 
"/D394 



57- N16/D0 

58- »/D47 



59- 
60- 
61- 
61- 
63- 
64- 



"/D62 

"/D83 

"/D137 

"/D167 

"/D197 

M /D370 



38-N144/D63 




I 










39- »/D72 




I 










40- "/D91 




2+ 




2+ 






41- M /D289 


+ 


4+ 


+ 


3+ 


2+ 




42- "/D233 




4+ 


.3+ 


4+ 


2+ 




43-N122/D0 




I 










44- "/D51 




I 


I ... 


■ : I 




i- 


45- »»/D57 




2+ 


1 . 








46- n /D72 


+ 


2+ 




3 + 


I 




47- " "/D94 


+ 


. 3+ 


'.+ • 


4+ 


'+ 




48- "/D199 


+ .. 


4+ 


2+ 


4 + 


+ 





40 



45 



The screening sera GLI-l, FEC, BV, and SKF have been 
defined above. The numbered sera samples correspond to 
human clinical serum samples which were PT^NANBH positive: 
these samples were obtained from Dr. Francoise Fabiani- 
Lunel, Hospital La Pitie Salpetriere, Paris, France. As 
can be seen from the results presented in Table 11, the 
antigens produced by clones 36 and 40, while not as sensi- 
tive as 4 09-1-1 (c-a) , do yield HCV-specific immunopositive 
signals. 



WO 91/15516 PCT/US91/02370 



73 • 

Example 14 
Isolati on of 409-1-1 Fusion Protein 
Sepharose 4B beads conjugated with anti-beta galacto- 
sidase were purchased from Promega. The beads were packed 
5 in 2 ml column and washed successively with phosphate- 
buffered saline with 0.02% sodium azide and 10 ml TX buffer 
(10 mM Tris buffer, pH 7.4, 1% aprotinin) . 

BNN103 lysogens infected with gtll/4 09-1-1 (c-a) from 
Example 12 were used to inoculate 500 ml of NZYDT broth. 
10 The culture was incubated at 3 2°C with aeration to an O.D. 

of about .2 to .4, then brought to 43° C quickly in a 43°C 
water bath for 15 minutes to induce gtii peptide synthesis, 
and incubated further at 37°C for l hour. The cells were 
pelleted by centrifugation, suspended in 10 ml of lysis 
15 buffer (10 mM Tris, pH 7.4 containing 2% Triton X-100™ and 
1% aprotinin added just before use. The resuspended cells 
were frozen in liquid nitrogen, then thawed, resulting in 
substantially complete cell lysis. The lysate was treated 
with DNasel to digest bacterial and phage DNA, as evidenced . 
20 by a gradual loss of viscosity in the lysate. Non-solubi- 
lized material was removed by centrifugation. 

The clarified lysate material was loaded on the 
Sepharose column, the ends of the column were closed, and 
the column was placed oh a rotary shaker for 2 hrs. at room 
25 temperature and 16 hours at 4°C. After the column settled/ 
it was washed with 10 ml of TX buffer. The fused protein 
was eluted with 0.1 M carbonate/bicarbonate buffer, pHlO. 
A total of 14 ml of the elution buffer was passed through 
the column, and the fusion protein eluted in the first 4-6 
30 ml of eluate. 

The first 6 ml of eluate from the affinity column were 
concentrated in Centricon™-30 cartridges (Amicon, Danvers, 
Mass . ) . The final protein concentrate was resuspended in 
400 m1 PBS buffer. Protein purity was analyzed by SDS-PAGE. 
35 A single prominent band was observed. 
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Example 15 

Preparation of An-H-doo-i-i fn ^ s tn tfhn^ 
The 409-1-1 (c-a) digest fragments from lambda gtll 
were released by EcoRI digestion of the phage, and the "A" 
5 region purified by gel electrophoresis. The purified frag- 
ment was introduced into the pGEX expression vector 
(Smith). Expression of glutathione S-transf erase fused 
protein (Sj26 fused protein) containing the 409-1-1 (a) 
peptide antigen was achieved in E. coli strain KM392 
10 (above) .. The fusion protein was isolated from lysed 
bacteria, and isolated by affinity chromatography on a 
column packed with glutathione-conjugated beads, according 
to published methods (Smith). 

The purified Sj26/409-l-i (a) fused protein was 
injected subcutaneously in Freund's adjuvant in a rabbit. 
Approximately l mg of fused protein was injected at days 0 
and 21, and rabbit serum was collected on days 42 and 56. 



A purified Sj 26/5-1-1 fused protein -was similarly 

prepared using the an amplified HCV fragment encoding the 

5-1-1 fragment. The fused Sj26/5-l-l protein was used to- 

immunize a second rabbit, following the same immunization 

schedule. A third rabbit was similarly immunized with 

purified sj 26 protein obtained from .control bacterial 
25 lysate. 

Minilysates from the following bacterial cultures were 
prepared as described in Example 12: . (1) KM392 cells 
infected with pGEX, pGEX containing the 5-1-1 insert, and 
pGEX containing the 409-i-i(a) insert; and (2) BNN103 
infected with lambda gtll. containing the 5-1-1 insert and 
gtll containing the 409-1-1 (c-a) insert. The minilysates 
were fractionated by SDS-PAGE, and the bands transferred to 
nitrocellulose filters for Western blotting as described in 
Example 12. Table 12 shows the pattern of immunoreaction 
35 which was observed when the five lysate preparations (con- 
taining the antigens -shown at -the -left ~in-the table) were 
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screened with each of the three rabbit immune sera. Summa- 
rizing the results, serum from control (Sj26) rabbits was 
immunoreactive with each of the Sj26 and Sj26 fused protein 
antigens. Serum from the animal immunized with Sj26/5-i-i 
fused protein was reactive with all three Sj-26 antigens 
and with the beta-gal/ 5-1-1 fusion protein, indicating the 
presence of specific immunoreaction with the 5-1-1 antigen. 
Serum from the animal immunized with Sj26/409-l-l (a) fused 
protein was reactive with all three Sj-26 antigens and with 
the beta-gal/409-l-l(c-a) fusion protein, indicating the 
presence of specific immunoreaction with the 409-1-1 (a) 
antigen. None, of the sera were immunoreactive with beta- 
galactosidase (obtained from a commercial source). 



15 



20 



25 



30 



35 



Antigens 



Sj2 6 



Table 12 



Sj26 



Sj26 

5-1-1/ 
(Sj26) 

5-1-1/ 

OS-bal) 

409-1-1 (a) 
(Sj26) 

409-l-l(c-a) 

tf-gai) 



Antibody 



5-l-l/Sj26 



409-l-l(a) /- 



40 



Anti-409-l-i(a) antibody present in the sera from the 
animal immunized with the Sj26/409-l-l (a) is purified by 
affinity chromatography, following the general procedures 
described in Example 12, but where the ligand derivatized 
to the Sepharose beads is the purified beta-gal/409-l-l (c- 
a) fusion protein, rather than the anti-beta-galactosidase 
antibody. — — — — ■ •- - - -- ••; -' 
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Example 16 

Cloning the HCV Cap s id Pro^in coding s,. T i«n™»e 
The example describes the cloning of HCV coding 
sequences which encodes the N-terminal region of the HCV 
5 capsid protein. . 

The protein sequence of the HCV-capsid associated 
antigen corresponds to the nucleotide residues 325-970 of 
the full length HCV sequence (see Appendix A). The 
following sequences were used as PCR primers to clone this 
10 region: SF2(C), 5' end starting at nucleotide 325 of the 
full length HCV sequence (Appendix), 5 ' -GCGCCCATGGGCACG- 
ATTCCCAAACCTCA; and SRI (C) ,3' end starting at nucleotide 
969. of the full length HCV sequence (Appendix), 5'-GCCGG- 

ATCCCTATTACTC (G/A) TACACAAT(A/G)>CT(C/T) GAGTT (A/G) G. The 

anticipated size of the fragment generated using the 
SF2(C)/SR1(C) primer pair was 644 base pairs. 

SISPA-amplified cDNA fragments from Example 7 were 
mixed with 100 nl Buffer A, 1 „m of equal molar amounts of 
each SR2 and SF1 primer given above, 200 jM each of dATP, 
20 dCTP, dGTP, and dTTP , and 2.5 units of Thermus aquati^l 
DNA polymerase (Taq polymerase) , as in Example 8. 

Specific amplification of the SISPA-amplified cDNA 
fragments with -the capsid primer pair . given above was 
. ..carried out under conditions similar to those described in 
25 Example 7, with 1 minute at 72°C and about 30 cycles. 

The amplified fragment mixtures from above were each 
fractionated by agarose gel electrophoresis on duplicate 
1.2% agarose gels, and one of the gels transferred to 

i 

nitrocellulose filters (Southern) for hybridization with 
30 with a radioactively labelled oligonucleotide (Southern) 
having the following sequence: SF3 (M/E) , 5' end starting 
at nucleotide 792 of the full length HCV sequence (Appen- 
dix) , 5'-GCGCCCATGGTTCTGGAAGACGGCGTG. This oligonucleotide 

C °5 reSp0ndS t0 a s^ 61 ^ internal to the amplification 
35 product generated by using the SF2(C) and SRl(C) primers. 
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Eight out of 15 PCR products were identified which gave a 
positive hybridization signal with the internal probe. 

The vectors pGEX (Example 15) and pET (NOVAGEN, 565 
Science Drive, Madison, WI 53711) were chosen for bacterial 
expression of protein sequences encoded by the inserts. 
The pGEX vector provided expression of the inserted coding 
sequences as fusion proteins to Sj 26 (see Examples 12 and 
15) and the pET vector provided expression of the cloned 
sequences alone. To clone the capsid sequences, the 
amplification product bands were excised from the duplicate 
gel. The DNA was extracted from the agarose and doubly- 
digested with Ncol and BamHl. .A pGEX vector containing the 
BamHI/NcoI cloning sites was also doubly digested with 
BamHI and Ncol. The vector and extracted DNA were then 
ligated under standard conditions and the ligation mixture 
transformed into bacterial cells. 

The bacterial transformants were cultured under 
ampicillin selection, and the plasmid DNA isolated by 
alkaline lysis (Maniatis et a 1. ). The isolated plasmid DNA 
was digested with Ncol and BamHI. The digestion products 
were then electrophoretically separated on an agarose gel. 
The gel was transferred to nitrocellulose and probed with 
radioactively labelled SF3 as above. Twelve clones were 
confirmed to have the insert of interest by the Southern 
25 blot analysis. 

Clones were generated in the pET vector in essentially 
the same manner. 
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Example 17 

Immunological sto ning pf j-^e putative wry 

Capsid Prote-in n nnoe 
This example describes the immunological screening of 
the putative HCV capsid protein clones which were obtained 
in Example 18. 

Of the twelve clones obtained in Example 16, protein 
mini-lysates of 7 clones (clones # 8, 14, 15, 56, 60, 65, 
and 66) were prepared as described in Example 12. These 
mini-lysates were fractionated as described and transferred 
to nitrocellulose for Western Blot analysis. Table 13 
shows the pattern of immunoreaction which was observed when 
the 7 lysate preparations were screened with the indicated 



Table 13 




The serum samples used for screening were identified 
as follows: SKF, HCV negative; FEC, HCV positive; BV, 
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community acquired HCV; A6 and B9 correspond to human 
clinical serum samples which were HCV positive. 

Immunoreactive bands identified on the Western blot 
were all smaller than the expected size 'of 50 kd (based on 
5 the predicted coding sequence of the cloned inserts , see 
below) . . 

Clone 15 was chosen for scale-up production of the. 
Sj26 fusion protein (Smith et al. ) . . A one liter prepara- 
tion of clone 15 yielded about 200 fig of purified immunore- 

10 active material. The bulk of the immunoreactive material 
appeared in a major doublet band which ran at approximately 
29 kd. The yield from this preparation was unexpectedly 
low: typically with ' the pGEX system a one liter protein 
preparation yields in the range of 50-100 mg fusion 

15 protein. 

Example 18 

Nucleic Acid Se quences of Clones 15 and 56 
The inserts of clones 15 and 56 (discussed in Example 
20 17) were sequenced as per the manufacturer's instructions 
(US Biochemical Corporation, Cleveland OH), using the 
dideoxy chain termination technique (Sanger, 1979). Each 
of the clones had an open reading frame contiguous with the 
Sj26 reading frame of the pGEX vector. The sequences of 
25 the clone inserts were near identical with only a few minor 
sequence variations: the sequence of clone 15 had a 
termination codon starting at nucleotide position 126. The 
sequence data for clone 56 is presented as SEQ ID NO: 11 and 
in Figure 8A. 

30 . The sequencing of the inserts revealed the unusual 

feature of a run of adenine residues from nucleotide 
position 25 to position 34 (Figure 8A) : such sequences are 
similar to sequences known to promote translation frame- 
shifting (Wilson et al., Atkins et, al. ) . The open reading 

35 frame contiguous with the Sj26 coding sequence" predicts a 
protein- of approximately 23 .5 kd. Accordingly," given the 
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approximately 26 kd size of the. Sj26 "protein fragment in 
this construct (Smith etal.), the complete fusion protein 
would be predicted to be approximately 50 kd. 

Example ^ Q - 

Hydropathicity Plot of the Pr-n» a in Bncodftd 'bv Clone 

The SOAP program from IntelliGenetics PC/ GENE™ soft- 
ware package was used to generate the hydropathicity plot 
of Figure 9, The SOAP program uses the method of Kyte et 
al. to plot the hydropathicity of the protein along its se- 
quence. The interval used f or the computation was 11 amino 
acids . In Figure 9 , the hydrophobic side of the plot 
corresponds to the positive values range and the hydrophil- 
ic side to the negative values range. 

The hydopathicity plot indicates (i) the hydrophilic 
nature of the amino terminus of the capsid protein, (ii) 
the relatively hydrophobic nature of the region of amino 
acid residues approximately 122 to 162, and (iii) the 
hydrophobic nature of amino acid residues approximately 
20 . 168-182. 

Further, the region of amino acid residues 168-182 
demonstrates potential for being a membrane spanning 
segment (Klein et al.). 



Example 20 

Deletion Analysis of the CI^p 56 Protein Coding ReeHnn 
This example describes the generation of a series of 
car boxy and amino terminal deletions of the HCV capsid pro- 
tein and the effect of these deletions on the immunoreacti- 
30 vity of the resulting proteins. 

A. Carboxy Terminal Deletions of clone 56. 
As one step to improve the expression of the HCV 
capsid protein, the putative region of translational 
frameshifting was modified to reduce the probability of a 
35 frameshift occurring in this region. In each AAA codon, 
^.encoding -lysine, (nucleotide positions-25 to 33, Figure 8A) 
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the third nucleotide in each codon (positions 27, 30 and 
33, Figure 8A) was changed from A to G using standard PCR 
mismatch techniques (Ausubel et al. , Mullis, Mullis et 
al.)- The sites of these substitutions are indicated in 
Figure 8 A by the three G ' s placed over the corresponding 
A's. The sequence of the modified pGEX clone was con- 
firmed as in Example 19 and the clone was named pGEX-CapA. 
The insert sequence of clone pGEX-CapA is shown in Figure^ 
8B and presented as SEQ ID NO: 13. 

The deletion clones were generated using the PCR 
primers given in Table 14 . In Table 14 the BamHT site is 
italicized and the termination codon is underlined. 

Table 14 

'. CARBOXY TERMINAL DELETION PRIMERS 



1. CI 5'-CGA TCC ATG GGC ACG AAT CCT AAA CC 

2. NC580 5'-G GCC GGA TCC TTA GGC CGA AGC GGG CAC AG 

3. NC520 5'-G GCC GGA TCC TTA ACC AGG AAG GTT CCC TGT TGC 

4. NC450 5'-G GCC GGA TCC TTA GGC CCT. GGC ACG GCC TCC 

20 5* NC360 5'-G GCC GGA TCC TTA CAA ATT GCG CGA CCT ACG CC 

6. NC270 5'-G GCC GGA TCC TTA GCC CTC ATT GCC ATA GAG 

Amplification reactions were carried out essentially 
as described in Example 16 .using primer CI paired with each 
25 of the NC primers , and purified plasmid pGEX-CapA as 
template: the amplification reaction was 1 minute at 95 ° / 
annealed 2 minutes at 50° and 3 minutes at 72° for 20 
cycles. . 

The following sequence comparisons are given relative 
30 to the nucleic acid sequence presented in Figure 8B, The 
CI. primer corresponds to the common 5* end of the pGEX-CapA 
insert which contains an Ncol site near the initiating 
methionine. The sequence of the NC primers each start at 
the nucleotide position indicated, for example, the homolo- 
35 gous sequence of the NC580 primer ends at nucleotide posi- 
— ^tion 580. A termination ^codon^is ^inserted at that posi- 
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tion, following a BamSI site. The positions of the primers 
given in Table 14 are indicated in Figure SB. The approxi- 
mate locations of the primers relative to the protein 
sequence are indicated in Figure 9. 

The resulting amplification products were electrophor- 
etically size fractionated on a polyacrylamide gel and the 
DNA products of the appropriate sizes electroeluted from 
the gel. The amplification products were cloned into both 
the pGEX and the pET vectors for expression. The sequences 
of the inserts were confirmed as described in Example 18. 

/The pGEX vectors containing the carboxy-terminal dele- 
tions were transformed into E. coli and the. fusion proteins 
purified essentially as follows. Expression of the fusion 
protein was induced with IPTG for 3-4 hours. The cells 
were then harvested at 6,000 rpm for 10 minutes ... The E. 
coll were then lysed in MTPBS buffer (150 mM NaCl; 16 mM 
N a2 HP0 4 ; 4 mM NaH 2 P0 4 , pH=8.0) after which 1% "TRITON X-100," 
3 ,/g/ml DNase I, and 1 mM PMSF were added. The lysates were 
centrifuged at 15,000 rpm for 20 minutes. The supernatants 
were discarded and the pellets resuspended in 8M. urea. The 
components of the resuspenion were separated by HPLC using, 
a "BIO-GEL SP-5-PW" column. Typically, the fusion protein 
was.±he predominant peak: the location of the fusion pro- 
tein was confirmed by Western Blot analysis. Clones 
C1NC270, C1NC360, and C1NC450 all expressed Sj2 6 fusion 
proteins at high levels: the fusion proteins all corre- 
sponded to the size predicted from the insert coding se- 
quence .fused to the Sj26 protein and were immunoreactive, 
with HCV-positive sera (Western Blots were performed as 
described in Example 17). Although the supernatants were 
discarded substantial amounts of the fusion proteins were 
also present in the supernatants. Clones C1NC520 and 
C1NC580 gave poor yeilds of fusion proteins. 

An epitope map of the HCV capsid region is presented - 
in Figure 10: the location of the immunoreactive protein 
coding sequences corresponding-to inserts ClNC45d,"ciNC360r" 
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and C1NC270 are indicated. The sequences of C1NC450, 
C1NC360, and C1NC270 are presented in the Sequence Listing 
as SEQ ID NO: 15, SEQ ID NO: 17, and SEQ ID NO: 19, respec- 
tively. 

B. Amino Terminal Deletions of Clone 56. 
Amino terminal deletion clones were generated using 
the PCR primers given in Table 15. 



10 



15 



1. 

2. 
3. 



20 



Table 15 ■ 
AMINO TERMINAL DELETION PRIMERS 
C100 GAG CCC ATG GGT GGA GTT TAC TTG TTG CC ' 
C270 GAG CCC ATG GGC TGC GGG TGG GCG GG 
C360 GAG CCC ATG GGT AAG GTC ATC GAT ACC 

Amplification reactions were carried out essentially 
as described above using the primer pairs . presented in 
Table 16 and purified plasmid pGEX-CapA as template: the 
amplification reaction included was 1 minute at 95° , 
annealed 2 minutes at 50°, and 3 minutes at 72° for 20 
cycles. 



Table 16 



25 



30 



35 



40 



NH 2 Primer ; COOH Primer Protein Produced? Immunoreactive? 



C100 



C270 



C360 



NC450 
NC360 
NC270 

NC450 
NC360 

NC450 



LOW 
YES 
YES 

YES 
YES 

YES 



YES 
YES 
YES 

NO 
NO 

NO 



The following sequence comparison are given relative 
to the nucleic acid sequence presented in Figure 8B where 
the above described A to G substitutions have been made for 
the sequence of pGEX-CapA. The NC660 primer corresponds to 



the common 3 ' end of the pGEX-CapA' insert which contains a 



WO 91/15516 PCT/US91/02370 



•84 

BamHI site near the end of the insert. The sequence of the 
C primers each start at the nucleotide position indicated, 
for example, the sequence of the NC100 primer begins at 
nucleotide position 100. Each of the C primers introduces 
5 an in-frame initiation codon in the resulting amplification 
product. The positions of the primers given in Table 15 
are indicated in Figure 8B. 

The resulting amplification products were cloned into 
the pGEX and pET vector for expression as described above. 

10 -The sequences of the inserts were confirmed. 

The pGEX vectors containing the carboxy- terminal 
deletions were transformed into E. coli, protein mini - 
lysates prepared, and the immunoreactivity of the proteins 
analyzed by Western Blots as described above. The results 

15 of the analysis are presented in Table 16. Clones 
C100NC270 and C100NC3 60 expressed Sj 26 fusion proteins at 
high levels: the fusion proteins corresponded to the size 
predicted from the insert coding sequence fused to the Sj26 
protein. 

20 An epitope map of the HCV capsid region is presented 

in Figure 10: the location of the protein coding sequences 
corresponding to inserts C100NC270, C100NC360, C270NC360, 
and C270NC450 are indicated. The sequences for C100NC270 

. and C100NC360 are presented in the Sequence Listing as SEQ 

25 id NO: 21 and SEQ ID NO: 23, respectively. 

Example 21 

Expanded Immunoscr^ enina Using the Cansid Antiaen 
This example describes three different comparisons of 
30 the immunoreactivity of the various HCV antigens of the 
present invention to several battery of sera . 
A. Effectiveness of Cap450 Antigen. 
Table 17 shows the results of 50 human sera samples 
from patients suspected of NANB hepatitis infection. The 
35 ELISA assays were performed essentially as described by 
___„.Tijssen using the following 3 antigens: C100, 409-1-1 (c-. 
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a) , C33u, Cap450 (the protein product of the pGEX-ClNC450 
clone) , and with 409-1-1 (c-a) and cap4150 in one well which 
was optimized to give the most sensitive results. These 
ELISA data were compared with the Abbott C100 test. 

Patient serum was scored positive for Sj26 fusion pro- 
teins (409-1-1 ca, 33u, 5-1-1, and Cap450) if the absor- 
bance was three times the absorbance of that serum on Sj26 
native protein. A sample was scored positive on pET anti- 
gens (cap360) if the absorbance was three times the mean of 
the absorbance .of negative control sera. A patient serum 
was scored positive on the combined 409-1-1 ca/cap450 assay 
if the absorbance was equivalent or greater than that of 
control positive sera. Samples within 10% of the control 
positive sera were scored weak positives. 
[Samples 1-19: Chronic active hepatitis proven by biopsy. 
HBS Ag(-) . 

Samples 20-44: Acute viral hepititis HBsAg(l), ISM Anti- 
HBC(-), IgM anti-HAV(-) . 

Samples 45-50: Chronic active hepatitis proven by biopsy. 
HBsAg(-) . 



Table 17 



25 



Korean Panel II 



30 



35 



1 
2 
3 
4 
5 
6 
7 
8 



Sampli 



C100 

+ 
+ 
+ 
+ 



+ 




C33u 

+ 
+ 
+ 
+ 
+ 
+ 
+ ' 



Cap 
450 

+ 
+ 
+ 

+ 
+ 
+ 



Combined 
409-1-1 (c-a) 
+CAP450 



+ 

+ 
+ 

+ 
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35 




. + 
w+ 



* = 



positive (low) 



The results demonstrate that the Cap4 50 protein has 
good sensitivity for detecting the presence of anti-HCV an- 
tibodies in sera samples. Three additional samples (6, 37, 
and 47) were detected. Further, these results indicate 
that the combination of Cap450 and 409-1-1 (c-a) can be used 
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to produce a kit which is very effective for detection of 
anti-HCV antibodies in human sera samples. 



10 



B. Cap450 and Cap360. 

The results in Table 18 demonstrate the effectiveness 
of the Cap450 and Cap360 antigen (the protein product 
encoded by of P ET-C1NC360) to detect HCV antibodies present 
in human sera. The samples were tested for the presence of 
HCV by. ELISA using each individual antigen shown, or with 
409-1-1 (c-a) and Cap450 antigens combined in one well. 
' ' • Table 18 



15 



20 



25 



30 



SERUM 


PATIENT DIAGNOSIS 


C100 


ELISA 
5-1-1 


409-1-1 
(c-a) 


C33u 


Cap 
360 


Comb ined 
(409-1-1) 
+Cap450 


G-131 


Acute Hepatitis; Pt "CO." 


■ — r 


— 


— 


— 


— 


- 


G-132 


Acute Hepatitis; Pt "CO." 














G-143 


Acute Hepatitis; Pt "CO." 














G-285 


Acute Hepatitis; Pt "CO." 


ND 


ND 


ND 


ND 


ND 




G-150 


Acute P.T. Hepatitis; Pt 
"G.L. " 






I 


I 


+ 


+ 


G-151 


Acute P. T. . Hepatitis; Pt 
"G.L. " 






I 






+ 


G-152 


Acute P.T. Hepatitis; Pt 
"G.L. " 










+ 




G-153 


Acute P.T. Hepatitis; Pt 
"G.L. " 






I- 




+ 


+■ . 


G-286 


Acute P.T. Hepatitis; Pt 
"G.L. " 


ND 


ND 


ND 


ND 


ND 


+ 


G-43 


Fulminant Liver Disease 














G-l 


Community Acquired 
Hepatitis 


ND 


I 


+ 




+ 


+ 


G-109 


Community Acquired 
Hepatitis 


+ 




+ 


+ t 


+ 


+ 


G-114 


Community Acquired 
Hepatitis 


ND 












G-128 


Community Acquired 
Hepatitis 


+ 




I > 


+ 


+ ' 


+ 


G-3 


Community Acquired 
Hepatitis 
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SERUM 



PATIENT DIAGNOSIS 



G-12 6 {Community Acquired 
epatitis 



C100 



ELISA 
5-1-1 



409-1-1 




Cap 


<c-a) 


C33u 


360 



G-12 7 (Community Acquired 
epatitis 



Combined 
(409-1-1) 
+Cap450 



r-42 (idiopath. Comm.. Ac. 
[epatitis 



r-51 [Community Acquired 
[epatitis B 



r-2 7 [community Ac quired 
[epatitis B 



(G-22 " [Community Acquired 
[epatitis B 



r-40 (Community Acquired 
[epatitis B 



r-31 [Community Acquired 
[epatitis B 



r-45 (community Acquired 
[epatitis B 



1-38 [Fulminant Hepatitis B 



r-41 [Community Acquired 
[epatitis C 



r-13 (Hepatitis C 



r-12 hepatitis C 



r-6 (Hepatitis C 



r-49 EtOH Cirrhosis 



r-2 5 EtOH Cirrhosis 



r-110EtOH Cirrhosis 



r-46 (EtOH Cirrhosis 



(G-272 (infant Liver Transplant 



ND 



r-2 74 Infant Liver Transplant 



ND 



r~16 



r-123 



r-122 



r-125 



PBC 



INC LT 



INC LT 



No Diagnosis 



-124 No Diagnosis 



These results indicate that the combination of antigen 
409-1-1 (c-a)~~a[nd~C^ i n a effective 
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diagnostic tool for detection of HCV infection. Five 
additional samples (Gl50,G15l, Glio, G125, and G124) were 
detected with these ELISA's compared with C100 test alone. 

C. pET360 

The results in Table 19 demonstrate the effectiveness 
of the pET360 to detect HCV antibodies present in human 
sera. The samples were tested for the presence of HCV by 
ELISA using each individual antigen shown, or with 409-1-1 
(c-a) and pET360 antigens combined in one well. 

.' Table 19 







C100 


5-1-1 


409-1-1 


I 

C33u 


- 

pET360 


Combined 
409-1-1 (c-a) 
+ pET360 




A 


+ 




+ 


+ 




+ 


15 "-. . . | 


B 




■ + 


+ 


+ 








C 








+ 








D 


+ 




+ 




+ 


+ 






+ 




w+ 


+ 


+ 


+ 




F 














20 


G 


+ 




w+ 




+ 


• . + 




H 






■ + 




+ 


+ 




I* 
















J 
















K 








• + 


+ 


+ ' 


25 


L 












* 




M 
















N - 




w+ 


* * 


+ 


+ 


+ 




O 


- + 


w+ 


+ 


+ 


+ 






P 


. + 


w+ 


+ 


. + 




+ 


30 


Q 














R 














I s 















35 



These results indicate that the combination of antigen 409-1-1 te- 
al .arid pET360 result in a effective diagnostic tool for detection of 
HC Y infectio "' Three additi onal samples were detected_with__these + 
ELISA's compared with C100 test alone. 
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Although the invention has been described with reference to 
particular embodiments, methods, construction and use, it will be 
apparent to those skilled in the art that various changes and modifica- 
tions can be made without departing from the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: - 

(i) APPLICANT: Reyes, Gregory 

Kim, Jungsuh P. 
Moeckli, Randolph 
Simonsen, Christian C. 

(ii) TITLE; OF INVENTION: Hepatitis C Virus Epitopes 

(iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Peter J. Dehlinger 

(B) STREET: 350 Cambridge Ave., Suite 100 

(C) CITY: Palo Alto 

(D) STATE : CA 

(E) COUNTRY: USA 

(F) ZIP: 94306 

(v) COMPUTER READABLE FORM: 
. (A) MEDIUM TYPE: Floppy disk 
, (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 505,611 

(B) FILING DATE: 06-APR-1990 

(vii) PRIOR APPLICATION DATA: * 

(A) APPLICATION NUMBER: US 594,854 

(B) FILING DATE: 09-OCT-1990 

(viii) ATTORNEY/ AGENT INFORMATION: 
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(A) NAME: Fabian, Gary R. 

(B) REGISTRATION NUMBER: 33,875 

(C) REFERENCE /DOCKET NUMBER: 4600-076.21 

<ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 323-8302 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 561 base . pairs . 
\ . (B) TYPE : nucleic acid 

.(C) STRANDEDNESS : double , 
(D) TOPOLOGY: linear 

. T . f — _ . * - ■ 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: .-' ' 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: 304-12-1 

(ix) FEATURE: 

- / (A) NAME /KEY: CDS 

(B) LOCATION: 1.. 561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAA TTC CTC GTG CAA GCG TGG AAG TCC AAG AAA ACC CCA ATG GGG TTC 48 
Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Giy Phe 
1 5 10 15 

. TCG TAT GAT ACC CGC TGC TTT GAC TCC ACA GTC ACT GAG AGC GAC ATC 96 
Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp lie 

' : " ^ r —™-20 " ' ■ — * ~ 25' v \' ' 30 - ^ 



CGT ACG GAG GAG GCA ATC TAC CAA TGT TGT GAC CTC GAC CCC CAA GCC 
Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala 
35 40 45 

CGC GTG GCC ATC AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT 
Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro 
50 - 55 60 

CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG 
Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala 

65 70 . ■ . 75 •. 80 

AGC GGC GTA CTG ACA ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie 

85 90 95 

* - * ■ 

AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC CAG GAC TGC ACC ATG 
Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met , 

100 105 no 

CTC GTG TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC 
Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser Ala Gly , Val 
115 120 / 125 ; " 

■ i ' ■ - '* ■ • j . . **- ■ ■ - 

CAG GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG GCT ATG ACC AGG 
Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg 
130 135 140 

TAC TCC GCC CCC CCC GGG GAC CCC CCA CAA CCA GAA TAC GAC TTG GAG 
Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu 
145 150 155 160 

CTC ATA ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC GGC GCT 
Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala 

165 170 . 175 

GGA AAG AGG GTC TAC TAC CTC ACC CGG GAA TTC 
Gly Lys Arg Val Tyr Tyr Leu Thr Arg Glu Phe 

180 185 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY:, linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys Thr Pro Met Gly Phe 
1 5 10 15 

Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp lie 

20 25 . 30 

Arg Thr Glu Glu Ala lie Tyr Gin Cys Cys Asp Leu Asp Pro Gin Ala * 
35 40 45 

Arg Val Ala lie Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro 
50 55 60 

Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr Arg Arg Cys Arg Ala 
65 , 70 75 80 

Ser ' Gly Val Leu Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr lie 

85 90 95 

Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu Gin Asp Cys Thr Met 

100/ 105 no 

Leu Val Cys Gly Asp Asp Leu Val Val lie. Cys Glu Ser Ala Gly Val 
115 120 125 

Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg 
130 135 140 

Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu 
145 150 155 160 

Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val Ala His Asp Gly Ala 

165 170 175 

Gly Lys Arg Val Tyr Tyr Leu Thr Arg Glu Phe 

180 * - 185 ~ : — ; — - -. 
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(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS:. 
. . (A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iy) ANTI-SENSE: NO 

( vi ) ORIGINAL SOURCE : 

(A) ORGANISM: Hepatitis HCV Virus 

(B) , STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 303-1-4 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .252 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:3: 

AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA ACA CCA ATA GAC 48 
Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr Pro He Asp 
1 5 * . 10 15 

ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT CAG CCT GAG AAG 96 
Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 

20 25 .30 

GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG GGC GTG 144 
Gly Gly Arg Lys Pro Ala Arg Leu , He Val Phe Pro Asp Leu Gly Val 
35 40 * 45 

CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT ACC AAG CTC CCC 192 
Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr Lys Leu Pro ~ 

50. • 55 . • 60 • 
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TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC TCA CCA GGA CAG 

Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 

65 70 75 80 

CGG GTT GAA TTC 
Arg Val Glu Phe 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids. ' 

(B) TYPE: amino acid. •' ■ : 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
Asn Ser Vail Trp Lys Asp Leu Leu Glu Asp Asn Val Thr Pro lie Asp 

{I ■ • . ..... • \:- S . •. :■ .. -,.10 '. '. .15 

1 ■ t ' * r * 

Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Gin Pro Glu Lys 

' 20 ■ . < . - 25 - 30 

Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp Leu Gly Val 

35 . . 40 ' 45 

Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr Lys Leu Pro 
50 / 55 60 

Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin 
65 70 75 80 

Arg Val Glu Phe 

(2) INFORMATION FOR SEQ ID NO: 5: 

SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 1512 base pairs : 
~ — (B)-TYPE:-nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE:, 

(A) ORGANISM: Hepatitis C Virus 

(vii) IMMEDIATE SOURCE : , " ■'' 

(B) CLONE: 303-1-4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1512 



(aci) SEQUENCE DESCRIPTION: SEQ ID NO:S: 

GAA TTC TTC ACA GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG CCC ' 48 
Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro 

. 1 ' / . ■•; ' 5 ., . .{ io ' . is . 

CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA TCA TTC AGA GTA GGA CTC 96 
Pro Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu 

20 25 . '-• • ; ' 30 

CAC GAA TAC CCG GTA GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAT 144 
His Glu Tyr Pro Val Gly Ser Gin -Leu Pro Cys Glu Pro Glu Pro Asp. 
'35 ' 40 . • ' 45 

GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC TCC CAT ATA ACA GCA 192 
Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala 
50 55 • 60 

GAG GCG GCC GGG CGA AGG TTG GCG AGG GGA TCA CCC CCC TCT GTG GCC 240 
Glu Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala 

65 70 75 , • . 80 



* 
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AGC TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT CTC AAG GCA ACT TGC 
Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys 

85 90 95 ; 

ACC GCT AAC CAT GAC TCC CCT GAT GCT GAG CTC ATA GAG GCC AAC CTC 
Thr Ala Asn His, Asp Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu 

100 * 105 110 . 

CTA TGG AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT GAG TCA GAA 
Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu 
115 120 125 

AAC AAA GTG GTG ATT CTG GAC TCC TTC GAT CCG CTT GTG GCG GAG GAG; 
Asn Lys Val Val lie Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu 
. 130 135 140 

GAC GAG CGG GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG TCT CGG 
Asp Glu Arg Glu lie Ser Val Pro Ala Glu lie Leu Arg Lys Ser Arg 
145 150 155 160' 

AGA TTC GCC CAG GCC CTG CCC GTT TGG GCG CGG CCG GAC TAT AAC CCC 
Arg Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro 

165 170 175 

CCG CTA GTG GAG ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG GTC 
Pro Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val 

180 • 185 190 

CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT 
His Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro 
195 200 205 

CGG AAG AAG CGG ACG GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 
Arg Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 
210 215 220 

TTG GCC GAG CTC GCC ACC* AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC 
Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly 
225 230 235 240 

ATT ACG GGC GAC AAT ACG ACA AGA TCC TCT GAG CCC GCC CCT TCT GGC 

lie Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly 
' • ' ' ^245 ~ 250 " — ~ , '255 
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864 



912 



960 



100 

TGC CCC CCC GAC TCC GAC OCT GAG TCC TAT TCC TCC ATG CCC CCC CTG 
Cys Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu 

260 265 270 

GAG GGG GAG CCT GGG GAT.CCG GAT CTT AGC GAC GGG TCA TGG TCA ACG 
Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr 
275 280 285 

GTC AGT AGT GAG GCC AAC. GCG GAG . GAT GTC GTG TGC TGC TCA ATG TCT 
Val Ser Ser Glu Ala Aen Ala Glu Asp Val Val Cys Cys Ser Met Ser 
290 295 300 " 

TAC TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC GCG GAA GAA CAG 
Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin 
305 • 310 315 320 

AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC CAC AAT • 1008 
Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn 

325 330 • 335 

TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT TGC CAA AGG CAG AAG AAA 1056 

Leu Val Tyr . Ser Thr Thr Ser Arg ser Ala Cys Gin Arg Gin Lys Lys . 

340 345 350 

GTC ACA TTT GAC AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC GTA 1104 
Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val 
355 360 365 

CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA GTG AAG GCT AAC TTG CTA 1152 
Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu 
.370 375 380 

TCC GTA GAG GAA GCT TGC AGC CTG ACG CCC CCA CAC TCA GCC AAA TCC 1200 
Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
385 • 3 90 395 400 

AAG TTT GGT TAT GGG GCA AAA GAC GTC CGT TGC CAT GCC AGA AAG GCC 1248 
Lys Phe. Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala 

405 410 .415 

GTA ACC CAC ATC AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA _ 1296 
Val Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val" 

4 20 425 • ' 430 — ;r— L ■ 
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ACA CCA ATA GAC ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT 
Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val 
435 440 445 



1344 



CAG CCT GAG AAG GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC 
Gin Pro Glu Lys Gly Gly Arg Lye Pro Ala Arg Leu He Val Phe Pro 
450 455 460 



1392 



GAT CTG GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT 
Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val 

465 „ i J.". 470 ; ... 475 480 



1440 



ACC AAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAG 
Thr , Lys Leu Pro Leu Ala Val Met Gly Ser 'Ser Tyr Gly Phe Gin Tyr 

485 ; 490 . 495 



1488 



TCA CCA GGA CAG CGG GTT GAA TTC 
Ser Pro Gly Gin Arg Val Glu Phe 

' ' 500 



1512 



(2) INFORMATION FOR SEQ ID NO:6: 

.(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Glu Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro 
1 5 10 15 

Pro Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu 

20 25 30 

His Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp 
35 40 45 



Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala 

^ -~50 — — 55 -, ; en' — \ 
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Glu Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala 
65 70 75 SO 

Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys £la Thr CyB 

85 90 95 

Thr Ala Asn His Asp Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu 

100 105 no 

Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu 
115 120 '. 125 

Asn Lys Val Val lie Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu 

130 135 - : 140 . ' 

Asp Glu Arg Glu lie Ser Val Pro Ala Glu lie Leu Arg Lys Ser Arg 
145 150 155 160 

Arg Phe Ala Gin Ala Leu Pro Vai Trp Ala Arg Pro Asp Tyr. Asn Pro 

165 170 175 

Pro Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val 

180 185 \; • 190 

His Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro 
195 200 205 

Arg Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 
210 215 "220 

Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly 
225 . . 230 235 240 

He Thr. Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly 

245 250 255 

Cys Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu 

260 265 270 

Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr 
275 . .280 .. ... : 285 
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Val Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser 
290 295 .300 

Tyr Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin 
305 310 315 320 

Lys Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn 

325 330 335 

Leu Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys 

340 345 350 

Val Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val 

355 ; 360. ■' • 365 

Leu Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu 
370 375 380 

Ser Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
385 390 395 400 

Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala 

405 4io 415 

Val Thr His lie Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val 

420 425 430 

Thr Pro lie Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val 
435 " 440 445 

Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu lie Val Phe Pro 
450 455 460 

Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val 
465 . 470 475 480 

Thr Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr 

485 490 495 

Ser Pro Gly Gin Arg Val Glu Phe 

_ . 500 ' 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: . cDNA to mRNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) , ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(C) INDIVIDUAL ISOLATE: Rodney 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 409-1-1 (c-a) 

• (ix) FEATURE: . 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAA TTC CGC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGG GCG TAC ATG 48 
Glu Phe Arg Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 
1 5 10 15 



96 



AAC ACT CCG GGG CTT CCC GTG TGC CAG GAC GGA ATT CCG TCC CCG TCC 
Asn Thr Pro Gly Leu Pro Val Cys Gin Asp Gly He Pro Ser Pro Ser ? 

20 25 30 



ACC ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA GTA 144 
Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val 
35 . 40 45 

ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG TGC 192 
lie Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys 

* * —50 — • - ,\ 55 * ~ : 60 " : — 
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GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG GCC 240 
Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly lie Asn Ala Val Ala 
65 70 75 80 

TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT GTT 288 
Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 

; 85 90 95 

GTC. GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC 336 
Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 

/.-.;.,_.-.., •. .. los no 

GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT TTC 384 
Asp Ser Val lie . Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
115 120 125 

AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG GAT 432 
Ser Leu Asp Pro Thr Phe Thr lie Glu Thr He Thr Leu Pro Gin Asp 
130 135 140 ' . 

GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC ACG GAA TTC 477 
Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Thr Glu Phe 
145 150 _ / 155 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
' (A) LENGTH: 159 amino acids 

(B) TYPE: amino acid V 
<D). TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Glu Phe Arg Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 

. 1 ; 5 io is 

'1 . . „ 

Asn Thr Pro Gly Leu Pro Val Cys Gin Asp Gly lie Pro Ser Pro Ser 

20 25 30 
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Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val 

35 ' _ 40 ^ '. . • 45 



He Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys 
50 55 



60 



Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly lie Ash Ala Val Ala 
65 70 



75 



80 



Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 

: ■ '.' ■ ''. 85 '.. • : ' . - ,90 . . : _ 95 

Val Val Val Ala Thr Asp Ala. Leu Met Thr Gly Tyr Thr Gly Asp Phe 

• 100 .. • .[ , . l05 : - ; no ■ ; 

Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 

115 . . : 120 125 . • 

Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro Gin Asp 
130 135 140 

Ala Val Ser Arg Thr' Gin Arg Arg Gly Arg" Thr Gly Thr Glu Phe 

145 . " . ' 150 . : 155 

(2) INFORMATION FOR SEQ ID NO: 9: • ," \ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs ' 
. ' (B) TYPE: nucleic acid : ' * ~ 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : ■ linear \ ' ^ 

(ii) MOLECULE TYPE : cDNA to mRNA 

. * . , ■■*[ . * ■ . ■ . ■ * ■■ ■ - . 

(iii) HYPOTHETICAL: NO . \ 1 \ 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Hepatitis C Virus 
'(B) STRAIN: CDC 



(vii) IMMEDIATE SOURCE: -~ ~ ™ 
(B) CLONE: 409-1-1 (abc) 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 
. (B) LOCATION : 1. .558 



r 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

TCC ACC. ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTC GAA 48 
Ser Thr Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu 

1 ■ ' 5 io i 5 

. GTA ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG - 96 

Val lie Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys Lys Lys : . ' 

20 . 25 . "- - ■ . 30 0 : 

TGC GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 144 
Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly lie Asn Ala; Val ' 

'■ ■ 35 40 -.45/ / 

GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT 192 
Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val lie Pro Thr Ser Gly Asp 

50 55 60 •-.;/';'-■'/.;■ 

GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC \ ; .240 
Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 

65 ' '. ." ., 70 , . . 75 ''. ' 80 ■ '' ' 



TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT ' . - .288 
Phe Asp Ser Val He Asp Cys Asn Thr Cys. Val Thr Gin Thr .Val Asp '. 

85 •' • - 90 .95 

TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG 336 
Phe ser Leu Asp Pro Thr Phe Thr He Glu Thr lie Thr Leu Pro Gin 

100 105 no 

GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG 384 
Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys 

; ■ us •■. 120 . , 125 

CCk GGC ATC TAC AGA TTT GTG GCA , CCG GGG GAG CGC CCC TCC , GGC ATG 432 
Pro Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met " 
.130 135 ; 140 
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TTC GAC TOG TCC GTC CTC TGT GAG TGC TAT GAC GCA GGC TGT GCT TGG 
Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp 
145 150 155 160 



165 170 

t 

AAC ACC CCG GGG CTT CCC GTG TGC CAG GAC 
Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 

180 185 



175 



(2) INFORMATION FOR SEQ ID NO:10: 

: (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

- (ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu 
• 1 5 • 10 15 

Val He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys 

20 25 30 

Cys Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val 
35 40 . 45 

Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 
50 55 v 60 

Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp 
65 7 0 75 80 

Phe Asp Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp 

85 . 90 95 



480 



TAT GAG CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA GCG TAC ATG 528 
Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 



558 
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Phe Ser Leu Asp Pro Ihr Phe Thr lie Glu Thr lie Thr Leu Pro Gin 

100 ' 105 110 

Asp Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys 
115 120 125 

Pro Gly lie Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met 
.130 135 ; 140 

Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp 
.145 . . 150 155 . 160 

Tyr Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met 

V 165 ■-■ 170 175 

Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 

180 185 

(2) INFORMATION FOR SEQ ID NO: 11: 

* (i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 657 base pairs 
: (B) TYPE: nucleic acid 
> : : (C) STRANDEDNESS : double . ; .. ; 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Hepatitis C Virus 
(6) STRAIN: CDC 

(Vii) IMMEDIATE SOURCE: 

(B) CLONE: GG1 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..657 



WO 91/15516 





PCT/US91/02370 



110 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 



ATG GGC ACG AAT CCT AAA CCT CAA AAA AAA AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 



48 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 



96 



GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT. GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 



144 



ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

\ 50 ' V . 55 ; 60 



192 



ATC CCC AAG GCT 
lie Pro Lys Ala 
65 



CGT CGG 

Arg Arg 
70 



CCC GAG 
Pro Glu 



GGC AGG ACC TGG 
Gly Arg Thr Trp 

75 - 



GCT CAG 
Ala Gin 



CCC GGG 
Pro Gly 

.80 



240 



TAC CCT TGG CCC 
Tyr Pro Trp Pro 



CTC TAT 
Leu Tyr 

• 85 .' 



GGC AAT 
Gly Asn 



CTC CTG 
Leu Leu 



TCT CCC 
Ser Pro 
100 



CGT GGC 
Arg Gly 



TCT CGG 
Ser Arg 



GAG GGC TGC GGG 
Glu Gly Cys Gly 

. . 90 

CCT AGC TGG GGC 
Pro Ser Trp Gly 
105 



TGG GCG 
Trp Ala' 



CCC ACA 
Pro Thr 

> 110 



GGA TGG 
Gly Trp 

' 95 . 

GAC CCC 
Asp Pro 



288 



336 



CGG CGT 
Arg Arg 



AGG TCG 
Arg Ser 
115 



CGC AAT 
Arg Asn 



TTG GGT 
Leu Gly 
120 



AAG GTC ATC GAT 
Lys Val lie Asp 



ACC CTT 
Thr Leu 
125 



ACG TGC 
Thr Cys 



384 



GGC TTC 
Gly Phe 
130 



GCC GAC 
Ala Asp 



CTC ATG 
Leu Met 



GGG TAC 
Gly Tyr 
135 



ATA CCG CTC GTC 
He Pro Leu Val 

140 



GGC GCC 
Gly Ala 



CCT CTT 
Pro Leu 



432 



GGA GGC GCT GCC AGG GCC CTG GCG 
Gly Gly Ala Ala Arg Ala Leu Ala 



145 



150 



CAT GGC GTC CGG 
His Gly Val Arg 

155 • - 



GTT CTG 
Val Leu 



GAA GAC 
Glu Asp 
160 



480 
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GGC GTG AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Ash Leu Pro Gly Cys Ser Phe Ser He 

165 170 175 

■ ^ * 

TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC 576 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 185 190 

CAA GTG CGC AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT 624 
Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200 205 

AAC TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 657 
Asn Ser Ser lie Val Tyr Glu * * Gly Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO:12: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

• (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

1 ' • - \ 5 10 " ; " ; ' 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr. Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

• 35 , / < 40 45 . 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro ; Gly* 
,65 70 75 80 
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Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

85 90 95 

t ... 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
.100 105 no . 

Arg Arg. Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
US 120 125 

. Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
I 30 . ■ 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 . • • 150. ■ 155 ■ . . iso 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

.. ' i65 170 . 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 185 190 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys, Pro 
195 200 205 

Asn Ser Ser He Val Tyr Glu * * Gly Ser 

■'"/'' 210 " " . ' "• 215 ' 

(2) INFORMATION FOR SEQ ID NO: 13 : "~ "." s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

" (vi) ORIGINAL SOURCE: . ^ — : 

(A) ORGANISM: Hepatitis C Virus . ^. — . „ i _- 
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(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 
■(B) CLONE: CapA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

• <■•'*...- * < * . ■ . " 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 48 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

. ■ - - * . * . ' ■ ' ... 

4 1 r ' . l ^ 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val LyB Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 144 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT " . ' . 192 
Thr Arg Lye Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 ' 55 60 ..... . 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 240 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 288 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gl^ Cys Gly Trp Ala Gly Trp 

85 90 95 

CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 336 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 110 

CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

~ 115 120 " 125 " ' ' ' ' ~ 
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6GC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT CTT 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 . 135 140 

GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 . 150 . 155 . i 6 o 

GGC GTG AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

165 170 ; 175 .. 1 

TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 las 190 

CAA GTG CGC AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT 
Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr . Asn Asp Cys Pro 
195 200 205 

AAC TCG AGC ATT GTG TAC GAG TAA TAG GGA TCC 
Asn Ser Ser lie Val Tyr Glu * * Gly ser 
210 - ,215 



(2) INFORMATION FOR SEQ ID, NO:14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

. ^2.9^ 25 30 



432 

t 



480 



528 



576 



624 



657 
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Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
/ 35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

85 90 '. — 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 . 105 no 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 

/ 165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

180 185 190 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
195 200. 205 

Asn Ser Ser He Val Tyr Glu * * Gly Ser 
210 215 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 
____ _ (B) TYPE: nucleic acid ' " ' • -~ '■: - . 

(C) STRANDEDNESS : double 



(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE : NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE.: C1NC450 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 453 



(xi) SEQUENCE DESCRIPTION: SEQ ID - NO: 15: 

AT6 GGC ACG AAT CCT AAA CCT CAG AAG AAG ARC AAA CGT AAC ACC AAC 

Met Gly Thr. Asn Pro Lys Pro Gin Lya Lys Asn Lys Arg Asn Thr Asn 
1 5 10 . . i 5 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 _ 25 .; . 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 
Thr Arg Lys Thr Ser Glu Arg. Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 ;•" 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 
lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 

65 . 70 ' 75 ■■ 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

85 - -V--^ — 9Q- . : . 



WO 91/15516 PCIYUS91/02370 



117 

CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC * ' 336 
Leu Leii Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 no 

CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

GGC TTC GCC GAC CTC ATG GGG TAG ATA CCG CTC GTC GGC GCC CCT CTT 432 
Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
_ 130 135 140 

GGA GGC GCT GCC AGG GCC TA 453 
Gly Gly Ala Ala Arg Ala 
145 150 



(2) INFORMATION FOR SEQ ID NO: 16: 

:. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

>*'».■ • . .' ' ■ . ' - 

(ii) MOLECULE TYPE: protein . 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:16: 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

y 1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 . 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 

65 , ; ' * 70 75 80 
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Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 : no 



Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu . Thr Cys 
115 120 



125 



Gly Phe Ala Asp Leu Met Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
.. ,130 . ' . .. ' 135 



140 



Gly Gly Ala Ala Arg Ala 

145 ."v.- V 150 



(2) INFORMATION FOR SEQ ID NO; 17: 

( i ) SEQUENCE CHARACTERISTICS : 
■ (A) LENGTH: 360 base pairs 
. (B) TYPE: nucleic acid . 
(C) STRANDEDNESS : double 
- (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA , 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

'\ (A) ORGANISM: Hepatitis C Virus 
(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC360 

(ix) FEATURE: 
• •• ' ' (A) NAME/KEY: CDS 

(B) LOCATION: 1.. 3 60 



( xi SEQUENCE-DESCRIPTION : SEQ ID NO: 17 
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ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 
• 1 5 10 15 



48 



CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 



96 



GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 



144 



ACG AGA AAG ACT TCC GAG CGG .TCG CAA.CCT CGA GGT AGA CGT CAG CCT 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 . 60 



192 



ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 80 



240 



TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 

85 90 ' " 95 : 



288 



CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC 
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 no 



336 



CGG CGT AGG TCG CGC AAT TTG . TA 

Arg Arg Arg Ser Arg Asn Leu 

115 120 



360 



.(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: pr tein — r 



(xi)-SEQUENCE DESCRIPTION: SEQ *ID ~NO:~I87 
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Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

15 



1 .... 5 10 



Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin Tie Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 \ . 60 



Gly 



He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro 
65 7 ° . 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp' Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 



100 105 



110 



Arg Arg Arg Ser Arg Asn Leu 

115. - 



(2) INFORMATION FOR SEQ ID NO: 19: 

- (i) SEQUENCE CHARACTERISTICS: 

(A) , LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 
- (B) STRAIN: CDC-— — 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC2 70 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GGC ACG AAT CCT AAA CCT CAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT CAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG CGC GCG 
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT 
Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG^CCC GGG 
He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 
65 70 75 . 80 

TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TA 
Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 

85 . • 90 



(2) INFORMATION FOR SEQ ID NO:20: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

; (ii) MOLECULE TYPE: pr tein 



48 



96 



144 



192 



240 



273 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
Met Gly Thr AsxxPro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

1 ; : - 5 , . ; ;} 10 15 . 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 



25 



30 



Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
Thr Arg Lyslthr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

>.V 80 :'\. ^V'-V -V 55 '" . , : : :\\ 60 V',-' :. ; . 

' ' - . . . *! f J ' - ■ ■ . " . * - . ' ' ,' V ■* •, ■■ ■ ' ; . ' ' ' ," ' - ' - 

lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly 

65 .: - : ; 70 80 

Tyr p r o Trp Pro Leu Tyr Gly Asn Glu Gly 



85 



90 



( ) RMATION FOR SEQ ID NO:21: 

> ■ ' r ' ' * ; + " ' ■ ' * ' ' ' r- 

5 r ■ ^ J , * ^ " ' * 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

r . { ii ) MOLECULE TYPE : ; cDNA to mRNA 

' (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO '; • ;\' . ' 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 



(vii) IMMEDIATE SOURCE: 

• (B) CLONE": C100NC270 



(.ix )_ FEATURE : 
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(A) NAME/KEY: CDS 

(B) LOCATION: 1..183 



(xi) SEQUENCE DESCRIPTION: SEQ- ID NO: 21: 
ATG GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG 

Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 

1 5 ■ 10 15 

CGC GCG ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro, Arg Gly Arg Arg. 

20 ■ ... . ■ . ■ 25. . 30 

CAG CCT ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG 
Gin Pro lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 45 

CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TA 
Pro Gly Tyr Pro Trp Pro Leu Tyr . Gly Asn Glu Gly 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
1 5 10 is 

Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 

20 25 30 

Gin. Pro He Pro Lys Ala Arg Arg Pro Glu' Gly Arg Thr Trp Ala Gin 
35 40 45 



48 



96 



144 



183 
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Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
-(D) TOPOLOGY: linear 

( 11 ) MOLECULE TYPE : cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

( vii) IMMEDIATE SOURCE : 

(B) CLONE: C100NC360 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
"" (B) LOCATION: /.I.. 270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



ATG GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TTG GGT GTG 
Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 



10 



15 



CGC GCG ACG AGA AAG ACT TCC GAG CGG TCG CAA CCT CGA GGT AGA CGT 
Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 

: 20 . . : 25 ; ' 30 

CAG CCT ATC CCC AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT CAG 
Gin Pro lie Pro" Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 



48 



96. 



144 
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CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG 192 
Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala 
50 55 60 

GGA TGG CTC CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA 240 
Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
65 70 75 80 

GAC CCC CGG CGT AGG TCG CGC AAT TTG TA 270 
Asp Pro Arg Arg Arg Ser Arg Asn Leu 

85 90 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val 
1 5 10 15 

Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg 

20 25 30 

Gin Pro He Pro Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala Gin 
35 40 .45 

Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala 
50 55 60 

Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
65 70 75 80 

Asp Pro Arg Arg Arg Ser Arg Asn Leu . 

85 



WO 91/15516 




PCT/US91/02370 



126 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 
: , (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hepatitis C Virus 

(B) STRAIN: CDC 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C1NC105 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..106 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG GGC ACG AAT CCT AAA CCT GAG AAG AAG AAC AAA CGT AAC ACC AAC 
Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 

1 . 5 : io -v is 

CGT CGC CCA CAG GAC GTC AAG TTC CCG GGT GGC GGT GAG ATC GTT GGT 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 25 30 

GGA GTT TTA A ' • 

Gly Val Leu 

35 .. 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Met Gly Thr Asn Pro Lys Pro Gin Lys Lys Asn Lys Arg Asn Thr Asn 



10 



15 



Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 

20 " 25 30 

Gly Val Leu 

35 ■ : ' . : 
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APPENDIX 



CAGGCXG7 
GTCCGACAGG. 




IGGaaAAACIGGTCCCGACCC 



61 





121 




ICCGGIAIATTGCITCl 




181 




241 




301 t cgg Itui , 





351 




TTCGIAG 



421 CQSUXCCacaiRCTCTOKTO 





EACGGACCAGC 



4fl l ACIRCCCSnaiaGCCTTTGGCATTA' 





541 




601 




651 




721 




781 




TTCCllLiULriUCaGACCCGC 




8 <1 6CGTCTGCTCClGCTTGTGCAltMVTGCTACTCATa• 

raasssssBSSSSEK 




9 °1 ACCTCGXAAlAOTAATGaU^TCCCTtKrr^^. 

TGGAGCAHMGAATTAeGTTrrr* ^:!5r55?^CGCACCGTCTTGTATCCTTCCTCC 

:gtgccagaacataggaaggagc 
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APPENDIX 



961 HjllLllX.lt,, 





1081 



ACaGTGCCIC&CCTCCCCAGCaCACE^^ 




1141 ^^^^^SSE^^^rra^^rc 

'CATAAAAG 




1201 ^ cca5acTG < aA <^CAi^ 

ACIGCTOCACOTCGClOTSaCGT^ 




1261 




1321 AAIIKItKIt^CCTCTrrra^ ,,, - ■_ 




1381 ffiSffiS"^^ 





1441 ^^ s ^^ss^^^^^^ aM 

1501 ATAACCATCTCACTCC 





1Sfi l CTGTAGAGCCAGTCGTCTTCTO 

...... ouaiara! ^^ 

1621 Mtesss^ 



1681 




1741 _ 

» _____ ™ttcam^K&&£££Z££& 




TCT" 



1 ^^^^^^^ 




1861 1GGa * cai ^ iaciMSGGG ^^ 
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1921 TCGCCTOCCCAACGGTCCrSTC 



rATACCAATGTACACCAASACCrTGTGG 
ACATCTGGtTCTGGAACACC 




2041 




2101 SCAGCCTGCIGTaSCCCCGGGCCaTTTCi 





2161 




CCGCGSTGIGCaCCO^GGAGIGG 



CTAAGGCGGTGGACTTTA' 




CCCGGTGT 

•TCl'LIUiliUGTACTCCAGGGGCCACA 



ICACGSA3AACXCCSI 




2341 CTCCCAi 

C3VGSGTGICCGXCSCCGZ 





2 *°l AGGTrcrsGxacrcMccG 



C I C I GT" " 




2461 





CC 






2641 




2701 




GaGGAGGl 1U.1LIVTCCACCA 




2761 CCGSMAGaiCCCTTTT 




'TCAAGGSGGGSA6AC 



28 21 ATCTC MC TI C I 



2881 
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2941 



3001 



CCT 




2061 £££2^^ 



3121 




3181 




3241 




3301 A?rag!_aOTS 



3361 



3481 GTTTSA1 . TCTCCTAlACaCACTGGGCS 



41 ^^^^^^^^S^^ 

3601 .ssssss^^ 

3751 

3781 -SS£S£S_S^ 
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3 841' AG_A^CCCTaraCTCOTC»G^ 
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3901 



3961 CT«KS»CAATACTTKCCGGCTTCn^ . 




4021 TGATGGCTTTTACAG 




4081 




4141 




4201 ACaiCCTTGCaGCGlJm»CGCGGGC5Tt»a»^f^ 



4261 




4331 gg£g^£ggSff£S?sgg»^ ^ 





4381 . __ 

"-CITCGCCXCCCSSGSGAACCATGm 

^CCTTGGTACAAA 

4441 




4501 




4S61 





IBA^TTUITCACCGATTTTCGATTCGAGIACGGT^ 
4681 CCTGCOGCGCGGGTAIAAGGGGGTC 






4801 




4 861 GTRC CCCCL T'l' LCl 'GCGCCGAACTACR CG l ruu r^r^r^rrw^., 
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4921 




4981 



5041 EK^AGGpiO^^ 




5101 'SFKESiSSSS^^ 



5161 J5"S£»^^ 

AGiaCIGCnGGIACGaGIGaCTAGSGAGGGlM A lIGl'C G I CT CC G C CGGCCCGCTTCCA 



5221 JS^S* 68 ^^ 




5281 gsigggg^ 




5341 £g£S^ 

S401 ;E5S255^^ 



5461 TC ^aSaAA TCCICCGC;^ ■ ■■■ »JJ 5rq r 





ccGGCCiisxAiiGGas^ 
5581 S££"£^^ 





5701 SSSHSSS^ 

5761 ^g^coro^^c^cpxgici^ 



GGGACCICarCTCG(»CCCC3^CCTAC^ 
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5881 GICAGSC CRAC GCgGACCATCICS 4.U lUwIUC UJsT 
CACICCCGIIGCGCCTCCIAQGCACAanCGAGTlS 



Tt 



5941 TCGTCACCCCGTGCGCCGC 

AGCAGTGGGGCclCGCGGCGCCZTCZTGTCTT 



6001 TGCT^CTCACTAC^TTTGGTGra^ 



6061 AGAAaGT CaGg TTGAC^ACTGGUVGTTClGSnCaTC^ 
TCITTCAGTCIAAACTGTCTGACGITCAAGA^^ 



6121 



TCC^'ri^ICGCCGCAGTTITGCTTCCGaiStaVACGATAGGCATC 



rTGC^ATCCCTAGAGGAAGCTTGCA 

XCC1TLUAACGT 



6181 GCCTGACSCCCCCACACTCttra 



6241 



IGGAAASACCTXCZGGAASACA 



6301 




6421 ssss^^ 



GGTTCITITGGG^ 

6601 Sff^Pj:*?^^ 

■^GS^IACAACACTGGAGCTGGGGSTTCGGG 

6661 SSS^^ 

rrccc 



6721 CGGSGG , 
CCCCG 



CTGTTGATCGACAC 



6781 SXSSHSFTS^ 



6841 ACTGCACp L T GC TCGTGTGTGGCgACGACTTASTOSTT. 



ATCTGTGAAAGCGCGGGGGTCC 
TAGACACTTTCGCGCCCCCAGG 
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OS 
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7201 TTA6CGTCCTTA' 
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IT IS CLAIMED: 



1. 



A recombinant polypeptide which, is immunoreactive 
with sera, from humans infected, with hepatitis C virus 
(HCV) . 

5 . 

2. The polypeptide of claim 1, which includes an 
immunoreactive. portion of an HCV polypeptide which: 

a) is encoded by. an HCV coding sequence; 

b) has 504 amino acid residues; and 

10 c) has the carboxy-terminal" sequence presented as SEQ 

ID NO: 4. - . 



3. The polypeptide of claim 2, which includes the 
entire 504 residue peptide. 



4. The polypeptide of claim 3 , whose carboxy- 
terminal amino acid sequence is encoded by the polynucleo- 
tide sequence presented as SEQ ID NO: 3. 

20 5 " The Polypeptide of claim 3, which is produced by 

the expression vector contained in. an Escherichia coli host 
identified by ATCC No. 40901. . . - . - 



6. The polypeptide of claim i, which includes an 
immunoreactive portion of the peptide sequence presented as 
SEQ ID N0:2. 

7. The polypeptide of claim 6, which includes the. 
specified sequence. 

8. The polypeptide' of claim 7, which is encoded by 
the polynucleotide sequence presented as SEQ ID N0:l. 
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9. The polypeptide of claim 7, which is produced by 
the expression vector contained in an Escherichia coli host 
identified by ATCC No. 40893. 

10. The polypeptide of claim 1, which includes the 
peptide sequence presented as SEQ ID NO: 8. 

11. The polypeptide of claim 10, which is encoded by 
the polynucleotide sequence presented as SEQ ID NO: 7. 

12. The polypeptide of claim 1, which includes the 
peptide sequence presented as SEQ ID NO: 10. 

13. The polypeptide of claim 12, which is encoded by 
15 the polynucleotide sequence presented as SEQ ID NO: 9. 

14. The polypeptide of claim. 1, which includes a 
peptide sequence selected from the group consisting of the 
peptide sequences presented as SEQ ID NO: 12, SEQ ID NO: 14/ 

20 SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ 

ID NO: 24, and SEQ ID NO:26. 

15. The polypeptide of claim 1, which is encoded by 
a polynucleotide sequence selected from the group of poly- 

25 nucleotide sequences presented as SEQ ID NO: 11, SEQ ID 
NO:13, SEQ ID NO:15, SEQ ID . NO: 17, " SEQ ID NO:19, SEQ ID 
N0:21, SEQ ID NO:23, and SEQ ID NO: 25. 

16. A diagnostic kit for use in screening human blood 
30 containing antibodies specific against hepatitis C virus 

(HCV) infection comprising 

at least one peptide antigen which is immunoreactive 
with sera from humans, infected with hepatitis C virus 
(HCV) , and 

35 . means for detecting "the binding of said antibodies to 

the antigen. - - "• — t~ : — — - 
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17. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of an HCV polypeptide sequence 
which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4. 

18. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 
as SEQ ID NO: 2 . 



19. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of the peptide sequence presented 

15 as SEQ ID NO: 8. 

20. The kit of claim 19, which further contains a 
second , antigen, wherein the second antigen includes an 
unmunoreactive portion of a peptide sequence selected from 

20 the group consisting of the peptide sequences presented as 
SEQ ID N0:12, SEQ ID N0:14, SEQ ID NO:16, SEQ ID NO:18, SEQ 
ID NO:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ ID N0:26 



21. The kit of claim 16, wherein. the antigen includes 
an immunoreactive portion of the peptide sequence presented 
as SEQ ID NO: 10. 

22. The kit of claim 16, wherein the antigen includes 
an immunoreactive portion of a peptide sequence selected 
from the group consisting of the peptide sequences present- 
ed as SEQ ID N0:12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID 
N0:18, SEQ ID NO:20, SEQ ID N0:22, SEQ ID NO:24, and SEQ ID 
N0:26. 

35 23. The kit of claim 16, wherein said detecting means 

includes.a solid support to which said peptideis attached 
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and a reporter-labeled anti-human antibody, wherein binding 

of said serum antibodies to said antigen can be detected by 
binding of the reporter-labeled antibody to said solid 
surface. 

5 

24. A method of detecting hepatitis C virus (HCV) 
infection in an individual comprising 

reacting serum from an HCV- infected test individual 
with at least one peptide antigen which is immunoreactive 
10 with sera "from humans infected with hepatitis C. virus 
* - (HCV) , and. 

examining the antigen for the presence of bound 
antibody. 

15 25 The method of claim- 24, wherein the peptide 

antigen includes an immunoreactive portion of an HCV 
peptide which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

20 • c ) has the carboxy-terminal sequence presented as SEQ 

ID NO: 4. . 

26. The method of claim 24/ wherein the peptide 
antigen includes an immunoreactive portion of the sequence 

25 presented as SEQ ID NO: 2. 

27. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
presented as SEQ ID NO: 8. 

28. The method of claim 24, which further contains a 
second antigen, wherein the second antigen includes an 
immunoreactive portion of a peptide sequence selected from 

Jthe group consisting of the peptide sequences presented as 

35 SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ 
'~~ -^ID-NOr20r~SEQ ID NO: 22 , SEQ ID NO : 24 , and *SEQ ID NO: 26 . 
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29. The method of claim 24, wherein the peptide 
antigen includes an immunoreactive portion of the sequence 
presented as SEQ ID NO: 10. * 

5 • 30 * The method of claim 24, wherein the antigen 

includes, an immunoreactive portion of a peptide sequence 
selected from the group consisting of the peptide sequences 
presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO: 24, and SEQ 
10 ID N0:26. ' : / ■ . : ■ * ,,. •. 

,31. The method of claim 24, wherein the peptide 
antigen is attached to a solid support, said reacting 

includes reacting the peptide antigen with the support, and 
15 subsequently: reacting the support with a reporter-labeled 
anti-human antibody, and said examining includes detecting 
the presence of reporter- labeled antibody on the solid 
.support. 

20 32 * A method of producing a polypeptide which is 

immunoreactive with sera from humans infected with hepati- 
tis C virus (HCV), comprising . 

introducing into a suitable host, a recombinant 
expression system containing an open reading frame (ORF) 
having a polynucleotide sequence which encodes a polypep- 
tide which is immunoreactive with sera from humans infected 
with hepatitis C virus (HCV), where the vector is designed 
to express the ORF in said host, and 

culturing said host under conditions resulting in the 
30 expression of the ORF sequence. 

33. The method of claim 32, wherein the polypeptide 
includes an immunoreactive portion of an HCV polypeptide 
which : 



a ) is encoded by an HCV coding sequence ; 

b ) . , has 5 04 amino .acid -residues , '-and — 



WO 91/15516 PCT/US91/02370 



133 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO: 4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
5 presented as SEQ ID NO: 3. 

34. The method of claim 33, wherein the expression 
vector is a lambda gtn phage vector, the host is E. coli 
and the host containing the introduced vector is identified 

10 by ATCC No. 40901. 

35. The method of claim 32, wherein the polypeptide 
has the sequence presented as SEQ ID NO: 2, and the 
polynucleotide has the sequence presented in SEQ ID NO: I. 

• 15 ■ ' 

1 * 

36. The method of claim 35, wherein the expression 
vector is a lambda gtll phage vector, the host. is E . coli, 
and the host containing the introduced vector is identified 
by ATCC No. 40893. 

37. The method of claim 32, wherein the polypeptide' 
has the sequence presented as SEQ ID NO: 8, and the polynu- 
cleotide has the sequence presented as SEQ ID NO: 7. 

38. The method of claim 37, wherein the expression 
vector is a lambda gtll phage vector, the host is E. coli, 
and the host containing the introduced vector is identified 
by ATCC No. 4 0792. . 

39. The method of claim 32, wherein the polypeptide 
has the sequence presented as SEQ ID NO: 10, and the 
polynucleotide has the sequence presented as SEQ ID NO: 9. 
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40. The method of claim 39, wherein the expression 
vector is a lambda gtll phage vector, the host is I. coli, 
and the host containing the introduced vector is identif ied 
by ATCC No. 40876. 

41. The method of claim 32, wherein the polypeptide 
includes an immunoreactive portion of a peptide sequence 
selected from the group consisting of the peptide sequences 
presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID N0:18, SEQ ID N0:20, SEQ ID NO:22, SEQ ID NO:24, and SEQ 
ID NO:26. 

42 . The method of claim 41, wherein the expression 
vector is a pGEX or pET vector and the host is E. coli. 

43. An expression system for expressing a recombinant 
peptide antigen which is immunoreactive with sera from 
humans infected with hepatitis c virus (HCV) , comprising 

a host capable of supporting expression of an open 
reading frame in a selected expression vector, and 

the selected expression vector containing an open 
reading frame (ORF) having a polynucleotide sequence which 
encodes said peptide antigen. , .. . , 

44. The expression system of claim 43, wherein the 
peptide antigen includes an immunoreactive portion of an 
HCV polypeptide sequence which: 

a) is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO : 4 ; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded ,by the polynucleotide sequence 
presented as SEQ ID NO: 3. 



35 
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45. The expression system of claim 44, wherein the 
expression vector is a lambda gt 11 phage vector, the host 

■is- .E. - coli, and the host containing the introduced vector 
is identified by ATCC No. 40901. 

5 

46. The expression system of claim 43, wherein the 
peptide antigen has the sequence presented as SEQ ID NO: 2, 
and the polynucleotide has the sequence presented as SEQ ID 
NO:l. 

io : " , : - ■ ■ ' . 

,47. The expression system of claim 46, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40893. 

15 ' • " .. 

48. The expression system of 43, wherein the peptide 
antigen has the sequence has the sequence presented as SEQ 
ID NO: 8, and the polynucleotide has the sequence presented 
as SEQ ID NO: 7. 

20 " ' . ; ; 

* ' * * i 

49. The expression system of claim 48, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, and the host containing the introduced vector 
is identified by ATCC No. 40792. 

50. The expression system of 43, wherein the peptide 
antigen has the sequence has the sequence presented as SEQ 
ID NO: 10, and the polynucleotide has the sequence presented 
as SEQ ID NO : 9 . 

51. The expression system of claim 50, wherein the 
expression vector is a lambda gtll phage vector, the host 
is E. coli, .and the host containing the introduced vector 
is identified by ATCC No. 40876. 
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52. The expression system of claim 43, wherein the 

polypeptide includes an immunoreactive portion of a peptide 

sequence selected from the group consisting of the peptide 

sequences presented as SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 

5 NO: 16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:24, and SEQ ID NO:26. 

53. The expression system of claim 52, wherein the 
expression vector is a pGEX or pET vector and the host is 

10 E. coll. •• - 

54. A polynucleotide which encodes a polypeptide 
which is immunoreactive with sera from humans infected with. 



hepatitis C virus (HCV) . 



55. The polynucleotide of claim 54, wherein the 
polypeptide includes an immunoreactive portion of a peptide 
sequence which: 

a) , is encoded by an HCV coding sequence; 

b) has 504 amino acid residues; and 

c) has the carboxy-terminal sequence presented as SEQ 
ID NO:4; and 

where the carboxy-terminal amino acid sequence of said 
peptide antigen is encoded by the polynucleotide sequence 
25 presented as. SEQ ID NO: 3. 

56. The polynucleotide of claim 54, which includes 
the polynucleotide sequence presented as SEQ ID N0:1. 



57. The polynucleotide of claim 54, which includes 
the polynucleotide sequence presented as SEQ ID NO: 9. 

58 . The polynucleotide of claim 54 , which includes a 
polynucleotide sequence _ selected from the group of polynu- 
cleotide sequences presented as SEQ ID NO: 11, SEQ ID NO: 13, 
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SEQ ID NO:15, SEQ ID.N0:17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ 
ID NO: 23, and SEQ ID NO: 25. 
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GAA TTC CTC GTG CAA GCG TGG AAG TCC AAG AAA ACC CCA ATG GGG TTC 

TCG TAT GAT ACC CGC TGC TTT .GAC TCC ACA GTC ACT GAG AGC GAC ATC 

CGT ACG GAG GAG GCA ATC TAC CAA TGT TGT GAC CTC GAC CCC CAA GCC 

CGC GTG GCC ATC AAG TCC CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT 

CTT ACC AAT TCA AGG GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG 

AGC GGC GTA CTG ACA ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 

AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC CAG GAC TGC ACC ATG 

CTC GTG TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC 

CAG GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG GCT ATG ACC AGG 

TAC TCC GCC CCC CCC GGG. GAC CCC CCA CAA CCA GAA TAC GAC TTG GAG 

CTC ATA' ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC GGC GCT 

GGA AAG AGG GTC TAC TAC CTC ACC CGG 

R9 «^ _ a 3 
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GAA TTC TTC ACA GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG CCC 
CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA TCA TTC ? AGA GTA GGA CTC 
CAC GAA TAC. CCG GTA GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAT 
. GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC TCC CAT ATA ACA GCA 
GAG GCG GCC GGG CGA AGG TTG GCG AGG GGA TCA CCC CCC TCT GTG GCC 
AGC TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT CTC AAG GCA ACT TGC 
ACC GCT AAC CAT GAC TCC CCT GAT GCT GAG CTC ATA GAG GCC AAC CTC 
CTA TGG AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT GAG TCA GAA 
AAC AAA GTG GTG ATT CTG GAC TCC TTC GAT CCG CTT GTG GCG GAG GAG 
GAC GAG CGG GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG TCT CGG 
AGA TTC GCC CAG GCC CTG CCC GTT TGG GCG CGG CCG GAC TAT AAC CCC 
CCG CTA GTG GAG ACG TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG GTC 
CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT 
CGG AAG AAG CGG ACG GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 
TTG GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC 
ATT ACG GGC GAC AAT ACG ACA ACA . TCC TCT GAG CCC GCC CCT TCT GGC 
TGC CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC ATG CCC CCC CTG 
GAG GGG GAG CCT GGG GAT CCG . GAT CTT AGC GAC GGG TCA TGG TCA ACG 
GTC AGT AGT G AG GCC AAC GCG GAG GAT GT C GTG TGC TGC TCA ATG TCT 
TAC TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC GCG GAA GAA CAG 
AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC CAC AAT 
TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT TGC CAA AGG CAG AAG AAA 
GTC ACA TTT GAC AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC GTA 
CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA GTG AAG GCT AAC TTG CTA 
TCC GTA GAG GAA GCT TGC AGC CTG ACG CCC CCA CAC TCA GCC AAA TCC^ 
AAG TTT GGT TAT GGG GCA AAA GAC GTC CGT TGC CAT GCC AGA AAG GCC 
GTA ACC CAC ATC AAC TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA 
ACA CCA ATA GAC ACT ACC ATC ATG GCT AAG AAC GAG "GTT TTC TGC GTT 
CAG CCT GAG AAG GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC 
GAT CTG GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG. TAC GAC GTG GTT 
ACC AAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC 
TCA CCA GGA CAG CGG GTT GAA TTC ' 

Pig. 4 
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MS T T G .EI P F Y G K A I P . L E 
CC ACC ACC GGA GAG ATC CCT. TTT TAC GGC AAG GCT ATC CCC CTC GAA 
V I K G G R H L I F C H S K K K 
GTA ATC AAG GGG GGG AGA CAT CTC ATC TTC TGT . CAT TCA AAG AAG AAG 

C D E L A A K L V A L G I, N A V 
TGC GAC GAA CTC GCC GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 

A Y Y R . G L D V S V I P T S G D 
GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC GAT 



V 



M 



GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC GGC GAC 



N 



V 



TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 



TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC ACG CTC CCC CAG 

D A v S R T Q R R G R T GBR G K 
GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG 

P G I Y R F V A P G E R , P S G M 
CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC CCC TCC GGC ATG 

F . D S S V L C E . C Y D A G C A W 
TTC GAC TCG TCC GTC CTC TGT- GAG TGC TAT GAC GCA GGC TGT GCT TGG 

Y E L A T P A E T T V R L R A. Y M 
TAT GAG CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA GCG TAC ATG 



N 



D * 



AAC ACC CCG GGG CTT CCC GTG TGC CAG GAC 



Fig. 5 
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Epitope Comparison/Delineation 



225 



2754 



409.1.1 



(557bp) 



4 09.1.1" F1R2 (270bp) 

409.1.1 (a) (375bp) 

3129 



409.1.1 (ca) (445bp) 



3 024 

Y Clone 33C 
(799bp) 



Clone 33C-TT 
(529bp) 



3311 



3129 



409.1.1 (u) (105bp) 



409.1.1 (C) (70bp) 



409.1.1 (c+270) 
(340bp) 



Fig. 7 
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GGG 

1 CC ATG GGC ACG ATT CCC AAA CCT CAA AAA AAA AAC AAA CGT 

M G T I P K P Q K K N K R 
43 AAC ACC AAC CGT CGC CCA CGG GAC GTC AAG TTC CCG GGT GGC 

N T N R R p R D V K F- P G G 
85 GGT CAG ATC GTT GGT GGA GTT TAC TTG TTG CCG CGC AGG GGC 

6 Q 1 V G G V Y - L P R R G 

127 CCT AGA TTG GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG CGG 

P R L G V R A T R K T S E R 
169 TCG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC AAG GCT CGT 

S Q P R G R R Q P I p K A R 
211 CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG TAC CCT TGG 

R P E G R TWA Q P G Y p W 
253 CCC CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG CTC 

P L Y G N E G C G W A G W L 
295 CTG TCT CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC 

L S P R GSR P S W G P T D 
337 CCC CGG CGT AGG TCG CGC AAT TTG GGT AAG GTC ATC GAT ACC 

P R R R S R N L G K V I D T 
379 CTT ACG TGC GGC TTC GCC GAC CTC ATG GGG TAC ATA CCG CTC 

L T C G F A D L M G Y I p L 
421 GTC GGC GCC CCT CTT GGA GGC GCT GCC AGG GCC CTG GCG CAT 

V G A P L G G A A R A L A H 
463 GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GGA ACA. GGG 

G V R V LED G V N Y A T G 
505 AAC CTT CCT GGT TGC TCT TTC TCT ATC TTC CTT CTG GCC CTG 

N L P G C S F S I F L L A L 
547 CTC TCT TGC TTG ACT GTG CCC GCT TCG GCC TAC CAA GTG CGC 

L S c L T V P A S A Y Q V R 
589 AAC TCC ACG GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT AAC 

N S T G L Y H V T N D C P N 
631 TCG AGC ATT GTG . TAC GAG TAA TAG GGA TCC 659 
S S I V Y E Z Z G S 

Fig; 8 A 
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10 20 30 40 50 so 

' «' ■ i, . '■ :. i \ \ 

AAA 
(wild Type) 

Primer CI 

METGlyThrAsnProLysProGlnLysLysAsnLysArgAsnThrAsnArgArgProGln 

70 80 90 100 110 120 

1 1 : i ! ! ! 

GACGTGAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGTTTACTTGTTGCCGCGCAGG 

qaacceat-< jaatGGAGrrTACTTGTTGPf; ->> 

Primer C100 




AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArg 

130 . 140 150 160 170 180 

■ »* J i r i 

GGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGAGCGGTeGCAACCTCGAGGT 

GlyProArgl^uGlyValArgAlaThrArgLysThrSerGluArgSerGl^ 

190 200 210 220 230 240 

V «■ i I ! | 

AGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAGGACCTGGGCTCAGCCCGGG 

ArgArgGlnProIleProLysAlaArgArgProGluGlyArgThrTrpAlaGlnProGly 

250 260 270 280 290 300 

' ' i i ! | 

TACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGCGGGATGGCTCCTGTCTCCC 

qaaccea- tGGGOTn^GGGTGGGgGGG 

Primer C270-> : 

<— CTCTATGGgAATnA GGGfyPaafTrratccaaee 

<- Primer NC270 — 
TyrProTrpProLeuTyrGlyAsnGluGlyCysGlyTrpAlaGlyTrpLeuLeuSerPro 



Fig. 8B 
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+ * * * * * - * 

310 320 330 340 350 360 

! ! i i l I 

l l l l l i 

CGTGGCTCTCGGCCTAGCTGGGGCCCCACA6ACCCCCGGCGTAGGTCGCGCAATTTGGGT 

aaacccaTGGGT 
(C360->) 

< — GGCGTAGGTCGCGCAATTTGtaa 

oPrimer NC360 

ggatccggcc 

J^gGlySerArgProSerTrpGlyProThrAspProArgArgArgSerArgAsiiI^uGly 

370 380 390 400 410 420 

| j i i i i 

i i i i i i 

AAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCATGGGGTACATACCGCTCGTC 
AAGGTCATCGATACC 

Primer C3 6 0-> 

LysVallleAspThrLeuThrCysGlyPheAlaAspLeuMETGlyTyrlleProLeuVal 
430 440 450 460 470 480 

. J j i i i i • 

i i i i i i 

GGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCGGGTTCTGGAAGAC 

< — GGAGGCGCTGCCAGGGCCtaaggatccggcc 
< — Primer NC450 

GlyAlaProteuGlyGlyAlaAlaArgAlaLeu^ 

490 500 510 520 530 540 

" ' ' r '■■*'■ •. i :* " - i i i . i 

r i i i i i 

GGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTCTCTATCTTC 

< — GCAACAGGGAACCTTCCTGGTTaaggatccggcc 

< — Primer NC520 

GlyValAsnTyrAlaTlirGlyAsnl^uProGlyCysSerPheSerllePheLeuLeuAla 

550 560 570 580 590 600 

-l '• ■ ■ ■ I • I I I l 

I I I I I I 

CTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCT 

< — CTGTGCCCGCTTCGGCCTAaggatccggcc 
< — Primer NC580 

LeuLeuSerCysLeuThrValProMaSerAlaTyrGl^ 

610 620 630 640 650 

i i i i | 

TACCACGTcAcCAATGATTGCCCTAACTCGAGCA 

< — GTACGAGTAATAGGGATCCgaa 
< — Primer NC660 
(3 •primer) 

TyrHisValThrAsnAspCysProAsnSerSerlleValTyrGlu GlySer 



Fig. 8B (con't) 
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AMINO ACID RESIDUES 




200*- 3 

NC660 

.9 




C1NC450 
C1NC3 60 
C1NC270 

C1NC105 

C100NC270 
C100NC3 60 

C270NC360 
C270NC450 



Pig— 1-0 



IMMUNORE ACTIVE 
NONRE ACTIVE 



TRANSMEMBRANE DOMAIN 



